Using persistent memory to
|
|
- Hilary Martin
- 6 years ago
- Views:
Transcription
1 Using persistent memory to Talk build Title a high-performant, Here fully Author Name, Company user space filesystem Krzysztof Czuryło, Intel
2 What it is? pmemfile Low-overhead userspace implementation of file APIs using persistent memory Open source: BSD license
3 What is is? Fully user-space not FUSE Builds on libpmemobj Specifically designed for Persistent Memory
4 pmem family libpmemblk libpmemlog syscall intercept libpmemfile posix libpmemobj libpmem libvmem libpmemfile vltrace librpmem libvmmalloc antool
5 Motivation Performance when HW is fast, SW becomes a bottleneck move kernel out of the stack Speed up PMEM adoption test your program with pmemfile (w/o any changes) if it performs better, consider rewriting it to use PMEM directly (i.e. libpmemobj)
6 Features Strong consistency/atomicity guarantees metadata and user data could be limited to metadata only Focused on performance takes advantage of NVDIMM bandwidth/latency Fine-grain granularity
7 Design
8 Design libpmemfile-posix syscall-like API can be directly used by applications libpmemfile transparent access to libpmemfile-posix pools thanks to syscall_intercept
9 Design User Applications GNU C Library (glibc) User Space GNU/Linux System Call Interface Kernel Kernel Space Architecture-Dependent Kernel Code Hardware Platform
10 Design User Applications pmemfile_write write pmemfile components libpmemfile-posix libpmemobj libpmemfile syscall_intercept glibc User Space load/store System Call Interface Kernel SYS_write Kernel Space Architecture-Dependent Kernel Code NVDIMM Hardware Platform
11 Design Syscall-like APIs vs. file I/O? hundreds of functions to implement/intercept open => open, open2, open64,... write => write, pwrite, fwrite, fprintf,... what about statically linked libc? what about syscalls issued by the program itself? Drawbacks intercepting system calls is not trivial
12 Intercepting system calls syscall_intercept Provides a low-level interface for hooking Linux system calls in user space Very simple API Open source:
13 Design Build on libpmemobj DAX-enabled filesystem / DAX Device memory-mapped files direct access to persistent memory (load/store) persistent memory allocator replication Fail-safety transactions atomic operations
14 libpmemfile-posix
15 libpmemfile-posix File system for persistent memory Runs in user-space No kernel overhead Interfaces modeled after the corresponding POSIX interfaces for file management about 60 functions: pmemfile_* open, openat, creat, close, link, unlink,..., read, write,... Easier transition for application developers
16 libpmemfile-posix #include <libpmemfile-posix.h> PMEMfile *pmemfile_open(pmemfilepool *pfp, const char *pathname, int flags,...); ssize_t pmemfile_read(pmemfilepool *pfp, PMEMfile *file, void *buf, size_t count); PMEMfilepool - filesystem (pmemfile pool) handle passed to each pmemfile_* function as the first argument PMEMfile - (pmem)file descriptor
17 libpmemfile-posix Multiple root directories multiple, distinct directory trees in one pool one pmemfile pool can handle multiple mounting points unsigned pmemfile_root_count(pmemfilepool *pfp); PMEMfile *pmemfile_open_root(pmemfilepool *pfp, unsigned index, int flags);
18 libpmemfile
19 libpmemfile User space persistent memory file system which is automatically enabled when libpmemfile is pre-loaded Nearly transparent access to persistent memory resident files Intercepts standard Linux glibc interfaces
20 Create filesystem mkfs-pmemfile path size creates pmemfile pool ("filesystem image") path should point to a pmem-aware filesystem or Device DAX $ mkfs-pmemfile /mnt/pmem/myfs 1G or $ mkfs-pmemfile /dev/dax1.0 0
21 Mount pmemfile-mount path mount-point convenient way to "mount" pmemfile pool / filesystem at given location libpmemfile reads mounts at load time $ sudo pmemfile-mount /mnt/pmem/myfs /tmp/mountpoint
22 Mount If pmemfile-mount can't be used i.e. no root privileges PMEMFILE_POOLS=/tmp/mountpoint:/dev/dax0.0' Files seen at /tmp/mountpoint/* are actually stored on filesystem backed by /dev/dax0.0 Syscalls related to those files are transparently redirected to libpmemfile-posix
23 Example $ alias pf='ld_preload=libpmemfile.so' $ alias pf='ld_preload=libpmemfile.so \ PMEMFILE_POOLS=/tmp/mountpoint:/dev/dax0.0' $ pf mkdir /tmp/mountpoint/dir_in_pmemfile $ pf cp README.md /tmp/mountpoint/dir_in_pmemfile $ pf ls -l /tmp/mountpoint/ total 0 drwxrwxrwx 2 user group 4008 Feb 16 17:46 dir_in_pmemfile $ pf ls -l /tmp/mountpoint/dir_in_pmemfile total 16 -rw-r--r-- 1 user group 1014 Feb 16 17:46 README.md $ pf cat /tmp/mountpoint/dir_in_pmemfile/readme.md wc -c $ ls -l /tmp/mountpoint/ total 0 $ ls -l /tmp/mountpoint/dir_in_pmemfile ls: cannot access '/tmp/mountpoint/dir_in_pmemfile': No such file or directory
24 Limitations
25 There are many...
26 Limitations No support for I/O event notification epoll_*, inotify*, poll, select,... No extended attributes All writes are synchronous (it's a feature actually!) no asynchronous I/O flushes are not needed / no-op sync, fsync, fdatasync,... No file locks (flock)
27 Limitations Memory mapping is not supported (yet) mmap, munmap, msync,... Can't execute program binaries stored in pmemfile pool because of mmap... No special files (mknod)... and some other minor issues see libpmemfile man page for details
28 Limitations No multi-process access (or very limited) libpmemobj limitation memory-mapped files (MAP_SHARED) - no COW workaround available (veeeery slow) Works only on Linux x86_64 other *NIX-like systems could be supported syscall_intercept/libpmemobj - work only on x86_64
29 Limitations Limited support for clone() fork() child process has no access to pmem files vfork() not supported No remote replication (not fail-safe)
30 vltrace vltrace Tool for tracing applications and evaluating whether libpmemfile.so supports them (
31 Performance results
32 Results Not much difference for read-only workload Performs well for heavy-write workload small writes appends for large data transfers memcpy is the limit Outperforms ext4+dax up to 2x, depending on the workload
33 Results
34 Q&A
35
36 Backup
37 Limitations Full list of non-supported syscalls chroot getsockname lsetxattr msync select epoll_ctl getsockopt madvise munlock setxattr epoll_pwait inotify_add_watch mknod munlockall swapoff epoll_wait inotify_rm_watch mknodat munmap tee fgetxattr ioctl mmap poll umount2 flistxattr lgetxattr mount ppoll vfork fremovexattr listxattr mprotect pselect fsetxattr lremovexattr mremap removexattr
38 Build and install git clone cd pmemfile mkdir build cd build cmake.. -DCMAKE_INSTALL_PREFIX=/usr make sudo make install cmake.. -DCMAKE_BUILD_TYPE=Debug -DDEVELOPER_MODE=1 \ -DTEST_DIR=/mnt/pmem/pmemfile-tests... ctest --output-on-failure
Impact on Application Development: SNIA NVM Programming Model in the Real World. Andy Rudoff pmem SW Architect, Intel
Impact on Development: SNIA NVM Programming Model in the Real World Andy Rudoff pmem SW Architect, Intel Agenda What everyone already knows about pmem What everyone forgets Ways to use pmem with no app
More informationsyscall_intercept A user space library for intercepting system calls Author Name, Company Krzysztof Czuryło, Intel
Talk syscall_intercept Title Here A user space library for intercepting system calls Author Name, Company Krzysztof Czuryło, Intel What it is? Provides a low-level interface for hooking Linux system calls
More informationAPIs for Persistent Memory Programming
APIs for Persistent Memory Programming MSST 2018 Andy Rudoff NVM Software Architect Intel Corporation Data Center Group A Full-Stack Example Using a key-value store as an example App Unmodified App, uses
More informationTHE IN-PLACE WORKING STORAGE TIER OPPORTUNITIES FOR SOFTWARE INNOVATORS KEN GIBSON, INTEL, DIRECTOR MEMORY SW ARCHITECTURE
THE IN-PLACE WORKING STORAGE TIER OPPORTUNITIES FOR SOFTWARE INNOVATORS KEN GIBSON, INTEL, DIRECTOR MEMORY SW ARCHITECTURE I/O LATENCY WILL SOON EXCEED MEDIA LATENCY 30 NVM Tread 25 NVM xfer Controller
More informationThe SNIA NVM Programming Model: Latest Developments and Challenges. Andy Rudoff, Intel Corporation
The SNIA NVM Programming Model: Latest Developments and Challenges Andy Rudoff, Intel Corporation Programming Model Four meanings (at least) 2 Programming Model: SW Interface to HW Core L1 L1 L2 Core L1
More informationUpdate on Windows Persistent Memory Support Neal Christiansen Microsoft
Update on Windows Persistent Memory Support Neal Christiansen Microsoft 1 Agenda What is Persistent Memory (PM) Review: Existing Windows PM Support What s New New PM APIs Large Page Support Hyper-V Support
More informationNAME attr extended attributes on XFS filesystem objects. SYNOPSIS attr [ LRq ] s attrname [ V attrvalue ] pathname
ATTR(1) XFS Compatibility API ATTR(1) attr extended attributes on XFS filesystem objects SYNOPSIS attr [ LRq ] s attrname [ V attrvalue ] pathname attr [ LRq ] g attrname pathname attr [ LRq ] r attrname
More informationCSE 333 SECTION 3. POSIX I/O Functions
CSE 333 SECTION 3 POSIX I/O Functions Administrivia Questions (?) HW1 Due Tonight Exercise 7 due Monday (out later today) POSIX Portable Operating System Interface Family of standards specified by the
More informationUsing persistent memory and RDMA for Ceph client write-back caching Scott Peterson, Senior Software Engineer Intel
Using persistent memory and RDMA for Ceph client write-back caching Scott Peterson, Senior Software Engineer Intel 2018 Storage Developer Conference. Intel Corporation. All Rights Reserved. 1 Ceph Concepts
More informationPersistent Memory: The Value to HPC and the Challenges
Persistent Memory: The Value to HPC and the Challenges November 12, 2017 Andy Rudoff Principal Engineer, NVM Software Intel Corporation Data Center Group Intel Persistent Memory New Type of Memory Persistent,
More informationFall 2017 :: CSE 306. File Systems Basics. Nima Honarmand
File Systems Basics Nima Honarmand File and inode File: user-level abstraction of storage (and other) devices Sequence of bytes inode: internal OS data structure representing a file inode stands for index
More informationWindows Support for PM. Tom Talpey, Microsoft
Windows Support for PM Tom Talpey, Microsoft Agenda Windows and Windows Server PM Industry Standards Support PMDK Support Hyper-V PM Support SQL Server PM Support Storage Spaces Direct PM Support SMB3
More informationcodius-sandbox Documentation
codius-sandbox Documentation Release 0.1.0 The Codius Team November 05, 2014 Contents 1 Dependencies 3 1.1 C++ API................................................. 3 1.2 Node.js API...............................................
More informationApplication Fault Tolerance Using Continuous Checkpoint/Restart
Application Fault Tolerance Using Continuous Checkpoint/Restart Tomoki Sekiyama Linux Technology Center Yokohama Research Laboratory Hitachi Ltd. Outline 1. Overview of Application Fault Tolerance and
More informationWindows Support for PM. Tom Talpey, Microsoft
Windows Support for PM Tom Talpey, Microsoft Agenda Industry Standards Support PMDK Open Source Support Hyper-V Support SQL Server Support Storage Spaces Direct Support SMB3 and RDMA Support 2 Windows
More informationCSE 333 SECTION 3. POSIX I/O Functions
CSE 333 SECTION 3 POSIX I/O Functions Administrivia Questions (?) HW1 Due Tonight HW2 Due Thursday, July 19 th Midterm on Monday, July 23 th 10:50-11:50 in TBD (And regular exercises in between) POSIX
More informationIN-PERSISTENT-MEMORY COMPUTING WITH JAVA ERIC KACZMAREK INTEL CORPORATION
IN-PERSISTENT-MEMORY COMPUTING WITH JAVA ERIC KACZMAREK INTEL CORPORATION LEGAL DISCLAIMER & OPTIMIZATION NOTICE INFORMATION IN THIS DOCUMENT IS PROVIDED AS IS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL
More informationDesign Overview of the FreeBSD Kernel CIS 657
Design Overview of the FreeBSD Kernel CIS 657 Organization of the Kernel Machine-independent 86% of the kernel (80% in 4.4BSD) C code Machine-dependent 14% of kernel Only 0.6% of kernel in assembler (2%
More informationDesign Overview of the FreeBSD Kernel. Organization of the Kernel. What Code is Machine Independent?
Design Overview of the FreeBSD Kernel CIS 657 Organization of the Kernel Machine-independent 86% of the kernel (80% in 4.4BSD) C C code Machine-dependent 14% of kernel Only 0.6% of kernel in assembler
More informationOverview. Over the next four weeks, we will look at these topics: Building Blocks. Advanced Authentication Issues.
Overview Over the next four weeks, we will look at these topics: Building Blocks Advanced Authentication Issues Security Overview Storage and its abstraction Virtualization and appliances Data Replication
More informationProtection and System Calls. Otto J. Anshus
Protection and System Calls Otto J. Anshus Protection Issues CPU protection Prevent a user from using the CPU for too long Throughput of jobs, and response time to events (incl. user interactive response
More informationPersistent Memory over Fabric (PMoF) Adding RDMA to Persistent Memory Pawel Szymanski Intel Corporation
Persistent Memory over Fabric (PMoF) Adding RDMA to Persistent Memory Pawel Szymanski Intel Corporation 1 Adding RDMA to Persisteny memory Agenda PMoF Overview Comparison with other remote replication
More informationBuilding on The NVM Programming Model A Windows Implementation
Building on The NVM Programming Model A Windows Implementation Chandra Konamki Sr Software Engineer, Microsoft Paul Luse Principal Engineer, Intel Open NVM Programming Model NVML Overview Abstraction Value
More informationFlavors of Memory supported by Linux, their use and benefit. Christoph Lameter, Ph.D,
Flavors of Memory supported by Linux, their use and benefit Christoph Lameter, Ph.D, Twitter: @qant Flavors Of Memory The term computer memory is a simple term but there are numerous nuances
More informationOperating Systems. II. Processes
Operating Systems II. Processes Ludovic Apvrille ludovic.apvrille@telecom-paristech.fr Eurecom, office 470 http://soc.eurecom.fr/os/ @OS Eurecom Outline Concepts Definitions and basic concepts Process
More informationApplications of. Virtual Memory in. OS Design
Applications of Virtual Memory in OS Design Nima Honarmand Introduction Virtual memory is a powerful level of indirection Indirection: IMO, the most powerful concept in Computer Science Fundamental Theorem
More informationCS 5460/6460 Operating Systems
CS 5460/6460 Operating Systems Fall 2009 Instructor: Matthew Flatt Lecturer: Kevin Tew TAs: Bigyan Mukherjee, Amrish Kapoor 1 Join the Mailing List! Reminders Make sure you can log into the CADE machines
More informationFailure-Atomic fle updates for Linux
Failure-Atomic fle updates for Linux Christoph Hellwig 1 / 20 Data integrity is hard Writes in Posix are not durable by default: Required a f(data)sync to be persistent Or the O_SYNC / O_DSYNC options
More informationFile Management 1/34
1/34 Learning Objectives system organization and recursive traversal buffering and memory mapping for performance Low-level data structures for implementing filesystems Disk space management for sample
More informationPOSIX Shared Memory. Linux/UNIX IPC Programming. Outline. Michael Kerrisk, man7.org c 2017 November 2017
Linux/UNIX IPC Programming POSIX Shared Memory Michael Kerrisk, man7.org c 2017 mtk@man7.org November 2017 Outline 10 POSIX Shared Memory 10-1 10.1 Overview 10-3 10.2 Creating and opening shared memory
More informationSystems Programming. 09. Filesystem in USErspace (FUSE) Alexander Holupirek
Systems Programming 09. Filesystem in USErspace (FUSE) Alexander Holupirek Database and Information Systems Group Department of Computer & Information Science University of Konstanz Summer Term 2008 Schedule
More informationThe Slow NVM (R)evolution in the context of Next Generation Postprocessing
The Slow NVM (R)evolution in the context of Next Generation Postprocessing 5. December 2016, Stuttgart Workshop on Sustained Simulation Performance Erich Focht NEC HPC Europe The K Problem Two things
More informationBuilding blocks for Unix power tools
for Unix power tools Now that we have given a good overview of a lot of the better Unix tools, I want to take some time to talk about our toolset for building Unix programs. The most important of these
More informationCMPS 105 Systems Programming. Prof. Darrell Long E2.371
+ CMPS 105 Systems Programming Prof. Darrell Long E2.371 darrell@ucsc.edu + Chapter 3: File I/O 2 + File I/O 3 n What attributes do files need? n Data storage n Byte stream n Named n Non-volatile n Shared
More informationNew types of Memory, their support in Linux and how to use them with RDMA
14 th ANNUAL WORKSHOP 2018 New types of Memory, their support in Linux and how to use them with RDMA Christoph Lameter, Ph.D., R&D Team Lead Jump Trading LLC April 5, 2018 Overview Why talk about memory?
More informationEXPLODE: a Lightweight, General System for Finding Serious Storage System Errors. Junfeng Yang, Can Sar, Dawson Engler Stanford University
EXPLODE: a Lightweight, General System for Finding Serious Storage System Errors Junfeng Yang, Can Sar, Dawson Engler Stanford University Why check storage systems? Storage system errors are among the
More informationFiles. Eric McCreath
Files Eric McCreath 2 What is a file? Information used by a computer system may be stored on a variety of storage mediums (magnetic disks, magnetic tapes, optical disks, flash disks etc). However, as a
More informationThread Concept. Thread. No. 3. Multiple single-threaded Process. One single-threaded Process. Process vs. Thread. One multi-threaded Process
EECS 3221 Operating System Fundamentals What is thread? Thread Concept No. 3 Thread Difference between a process and a thread Prof. Hui Jiang Dept of Electrical Engineering and Computer Science, York University
More informationVirtual File System (VFS) Implementation in Linux. Tushar B. Kute,
Virtual File System (VFS) Implementation in Linux Tushar B. Kute, http://tusharkute.com Virtual File System The Linux kernel implements the concept of Virtual File System (VFS, originally Virtual Filesystem
More informationPM Support in Linux and Windows. Dr. Stephen Bates, CTO, Eideticom Neal Christiansen, Principal Development Lead, Microsoft
PM Support in Linux and Windows Dr. Stephen Bates, CTO, Eideticom Neal Christiansen, Principal Development Lead, Microsoft Windows Support for Persistent Memory 2 Availability of Windows PM Support Client
More informationCSCE 548 Building Secure Software Dirty COW Race Condition Attack
CSCE 548 Building Secure Software Dirty COW Race Condition Attack Professor Lisa Luo Spring 2018 Outline Dirty COW vulnerability Memory Mapping using mmap() Map_shared, Map_Private Mapping Read-Only Files
More informationUsing NVDIMM under KVM. Applications of persistent memory in virtualization
Using NVDIMM under KVM Applications of persistent memory in virtualization Stefan Hajnoczi About me QEMU contributor since 2010 Focus on storage, tracing, performance Work in Red
More informationIntroduction. Let s start with the first set of slides
Tux Wars Class - 1 Table of Contents 1) Introduction to Linux and its history 2) Booting process of a linux system 3) Linux Kernel 4) What is a shell 5) Bash Shell 6) Anatomy of command 7) Let s make our
More informationThe bigger picture. File systems. User space operations. What s a file. A file system is the user space implementation of persistent storage.
The bigger picture File systems Johan Montelius KTH 2017 A file system is the user space implementation of persistent storage. a file is persistent i.e. it survives the termination of a process a file
More informationNVthreads: Practical Persistence for Multi-threaded Applications
NVthreads: Practical Persistence for Multi-threaded Applications Terry Hsu*, Purdue University Helge Brügner*, TU München Indrajit Roy*, Google Inc. Kimberly Keeton, Hewlett Packard Labs Patrick Eugster,
More informationECE 650 Systems Programming & Engineering. Spring 2018
ECE 650 Systems Programming & Engineering Spring 2018 Inter-process Communication (IPC) Tyler Bletsch Duke University Slides are adapted from Brian Rogers (Duke) Recall Process vs. Thread A process is
More information<Insert Picture Here> Btrfs Filesystem
Btrfs Filesystem Chris Mason Btrfs Goals General purpose filesystem that scales to very large storage Feature focused, providing features other Linux filesystems cannot Administration
More informationI/O and Syscalls in Critical Sections and their Implications for Transactional Memory
I/O and Syscalls in Critical Sections and their Implications for Transactional Memory Lee Baugh and Craig Zilles University of Illinois at Urbana-Champaign Side-Effects in Transactions begin_transaction();
More informationOMNIO: A Tool for I/O Recording, Analysis and Replay
OMNIO: A Tool for I/O Recording, Analysis and Replay Bryan Flynt Cooperative Institute for Research in the Atmosphere Colorado State University Fort Collins, Colorado USA Mark Govett Advanced Technology
More informationNVDIMM Overview. Technology, Linux, and Xen
NVDIMM Overview Technology, Linux, and Xen Who am I? What are NVDIMMs? A standard for allowing NVRAM to be exposed as normal memory Potential to dramatically change the way software is written But.. They
More informationFiles and File Systems
File Systems 1 Files and File Systems files: persistent, named data objects data consists of a sequence of numbered bytes alternatively, a file may have some internal structure, e.g., a file may consist
More informationEfficient Memory Mapped File I/O for In-Memory File Systems. Jungsik Choi, Jiwon Kim, Hwansoo Han
Efficient Memory Mapped File I/O for In-Memory File Systems Jungsik Choi, Jiwon Kim, Hwansoo Han Operations Per Second Storage Latency Close to DRAM SATA/SAS Flash SSD (~00μs) PCIe Flash SSD (~60 μs) D-XPoint
More informationAsynchronous Events on Linux
Asynchronous Events on Linux Frederic.Rossi@Ericsson.CA Open System Lab Systems Research June 25, 2002 Ericsson Research Canada Introduction Linux performs well as a general purpose OS but doesn t satisfy
More informationOperating System Structure
Operating System Structure Heechul Yun Disclaimer: some slides are adopted from the book authors slides with permission Recap: Memory Hierarchy Fast, Expensive Slow, Inexpensive 2 Recap Architectural support
More informationFile Systems. Jinkyu Jeong Computer Systems Laboratory Sungkyunkwan University
File Systems Jinkyu Jeong (jinkyu@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu SSE3044: Operating Systems, Fall 2016, Jinkyu Jeong (jinkyu@skku.edu) File System Layers
More informationCS 1550 Project 3: File Systems Directories Due: Sunday, July 22, 2012, 11:59pm Completed Due: Sunday, July 29, 2012, 11:59pm
CS 1550 Project 3: File Systems Directories Due: Sunday, July 22, 2012, 11:59pm Completed Due: Sunday, July 29, 2012, 11:59pm Description FUSE (http://fuse.sourceforge.net/) is a Linux kernel extension
More informationChapter 4. File Systems. Part 1
Chapter 4 File Systems Part 1 1 Reading Chapter 4: File Systems Chapter 10: Case Study 1: Linux (& Unix) 2 Long-Term Storage of Information Must store large amounts of data Information must survive the
More informationFile Systems. CS170 Fall 2018
File Systems CS170 Fall 2018 Table of Content File interface review File-System Structure File-System Implementation Directory Implementation Allocation Methods of Disk Space Free-Space Management Contiguous
More informationKprobes Presentation Overview
Kprobes Presentation Overview This talk is about how using the Linux kprobe kernel debugging API, may be used to subvert the kernels integrity by manipulating jprobes and kretprobes to patch the kernel.
More informationDeterministic Futexes Revisited
A. Zuepke Deterministic Futexes Revisited Alexander Zuepke, Robert Kaiser first.last@hs-rm.de A. Zuepke Futexes Futexes: underlying mechanism for thread synchronization in Linux libc provides: Mutexes
More informationOverview. Administrative. * HW 2 Grades. * HW 3 Due. Topics: * What are Threads? * Motivating Example : Async. Read() * POSIX Threads
Overview Administrative * HW 2 Grades * HW 3 Due Topics: * What are Threads? * Motivating Example : Async. Read() * POSIX Threads * Basic Thread Management * User vs. Kernel Threads * Thread Attributes
More information1 / 23. CS 137: File Systems. General Filesystem Design
1 / 23 CS 137: File Systems General Filesystem Design 2 / 23 Promises Made by Disks (etc.) Promises 1. I am a linear array of fixed-size blocks 1 2. You can access any block fairly quickly, regardless
More informationCaching and reliability
Caching and reliability Block cache Vs. Latency ~10 ns 1~ ms Access unit Byte (word) Sector Capacity Gigabytes Terabytes Price Expensive Cheap Caching disk contents in RAM Hit ratio h : probability of
More informationContainer Library and FUSE Container File System Softwarepraktikum für Fortgeschrittene
Container Library and FUSE Container File System Softwarepraktikum für Fortgeschrittene Parallele und Verteilte Systeme Institut für Informatik Ruprecht-Karls-Universität Heidelberg Michael Kuhn Matrikelnummer:
More informationStrata: A Cross Media File System. Youngjin Kwon, Henrique Fingler, Tyler Hunt, Simon Peter, Emmett Witchel, Thomas Anderson
A Cross Media File System Youngjin Kwon, Henrique Fingler, Tyler Hunt, Simon Peter, Emmett Witchel, Thomas Anderson 1 Let s build a fast server NoSQL store, Database, File server, Mail server Requirements
More informationOutline. Overview. Linux-specific, since kernel 2.6.0
Outline 25 Alternative I/O Models 25-1 25.1 Overview 25-3 25.2 Signal-driven I/O 25-9 25.3 I/O multiplexing: poll() 25-12 25.4 Problems with poll() and select() 25-29 25.5 The epoll API 25-32 25.6 epoll
More informationRF-IDs in the Kernel -- Episode III: I want to File Away
What s on the menu Software Comprehension and Maintenance June 2005 RF-IDs in the Kernel -- Episode III: I want to File Away Achilleas Anagnostopoulos (archie@istlab.dmst.aueb.gr) Department of Management
More informationLive block device operations in QEMU
Live block device operations in QEMU Paolo Bonzini Red Hat Yokohama, June 2012 1 Outline What is QEMU? The QEMU block layer Live block operations Q&A 2 What is QEMU? A FAST! processor emulator Started
More informationOperating System Structure
Operating System Structure Heechul Yun Disclaimer: some slides are adopted from the book authors slides with permission Recap OS needs to understand architecture Hardware (CPU, memory, disk) trends and
More informationProcesses. Johan Montelius KTH
Processes Johan Montelius KTH 2017 1 / 47 A process What is a process?... a computation a program i.e. a sequence of operations a set of data structures a set of registers means to interact with other
More informationOS Structure. Kevin Webb Swarthmore College January 25, Relevant xkcd:
OS Structure Kevin Webb Swarthmore College January 25, 2018 Relevant xkcd: One of the survivors, poking around in the ruins with the point of a spear, uncovers a singed photo of Richard Stallman. They
More informationINTRODUCTION TO THE UNIX FILE SYSTEM 1)
INTRODUCTION TO THE UNIX FILE SYSTEM 1) 1 FILE SHARING Unix supports the sharing of open files between different processes. We'll examine the data structures used by the kernel for all I/0. Three data
More informationIntroduction Tasks Memory VFS IPC UI. Escape. Nils Asmussen MKC, 07/09/ / 40
Escape Nils Asmussen MKC, 07/09/2015 1 / 40 Outline 1 Introduction 2 Tasks 3 Memory 4 VFS 5 IPC 6 UI 2 / 40 Outline 1 Introduction 2 Tasks 3 Memory 4 VFS 5 IPC 6 UI 3 / 40 Motivation Beginning Writing
More informationKernel Scalability. Adam Belay
Kernel Scalability Adam Belay Motivation Modern CPUs are predominantly multicore Applications rely heavily on kernel for networking, filesystem, etc. If kernel can t scale across many
More informationIntroduction to Unix. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University
Introduction to Unix Jin-Soo Kim (jinsookim@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu What is an OS? OS is a resource manager Sharing Protection Fairness Performance
More informationA process. the stack
A process Processes Johan Montelius What is a process?... a computation KTH 2017 a program i.e. a sequence of operations a set of data structures a set of registers means to interact with other processes
More informationCS5460/6460: Operating Systems. Lecture 24: Device drivers. Anton Burtsev April, 2014
CS5460/6460: Operating Systems Lecture 24: Device drivers Anton Burtsev April, 2014 Device drivers Conceptually Implement interface to hardware Expose some high-level interface to the kernel or applications
More informationRCU. ò Walk through two system calls in some detail. ò Open and read. ò Too much code to cover all FS system calls. ò 3 Cases for a dentry:
Logical Diagram VFS, Continued Don Porter CSE 506 Binary Formats RCU Memory Management File System Memory Allocators System Calls Device Drivers Networking Threads User Today s Lecture Kernel Sync CPU
More informationFS Facilities. Naming, APIs, and Caching OS Lecture 17. UdS/TUKL WS 2015 MPI-SWS 1
FS Facilities Naming, APIs, and Caching OS Lecture 17 UdS/TUKL WS 2015 MPI-SWS 1 Naming Files MPI-SWS 2 Recall: inodes What is an inode?» the data structure of a filesystem representing a byte stream (=
More informationThe cow and Zaphod... Virtual Memory #2 Feb. 21, 2007
15-410...The cow and Zaphod... Virtual Memory #2 Feb. 21, 2007 Dave Eckhardt Bruce Maggs 1 L16_VM2 Wean Synchronization Watch for exam e-mail Please answer promptly Computer Club demo night Thursday (2/22)
More informationVFS, Continued. Don Porter CSE 506
VFS, Continued Don Porter CSE 506 Logical Diagram Binary Formats Memory Allocators System Calls Threads User Today s Lecture Kernel RCU File System Networking Sync Memory Management Device Drivers CPU
More informationDistribution Kernel Security Hardening with ftrace
Distribution Kernel Security Hardening with ftrace Because sometimes your OS vendor just doesn't have the security features that you want. Written by: Corey Henderson Exploit Attack Surface Hardening system
More informationNon-Volatile Memory Through Customized Key-Value Stores
Non-Volatile Memory Through Customized Key-Value Stores Leonardo Mármol 1 Jorge Guerra 2 Marcos K. Aguilera 2 1 Florida International University 2 VMware L. Mármol, J. Guerra, M. K. Aguilera (FIU and VMware)
More informationProject 4: File System Implementation 1
Project 4: File System Implementation 1 Submit a gzipped tarball of your code to CourseWeb. Due: Friday, December 7, 2018 @11:59pm Late: Sunday, December 9, 2018 @11:59pm with 10% reduction per late day
More informationCSCE 313 Introduction to Computer Systems. Instructor: Dezhen Song
CSCE 313 Introduction to Computer Systems Instructor: Dezhen Song Programs, Processes, and Threads Programs and Processes Threads Programs, Processes, and Threads Programs and Processes Threads Processes
More informationFile Descriptors and Piping
File Descriptors and Piping CSC209: Software Tools and Systems Programming Furkan Alaca & Paul Vrbik University of Toronto Mississauga https://mcs.utm.utoronto.ca/~209/ Week 8 Today s topics File Descriptors
More informationOperating System Architecture. CS3026 Operating Systems Lecture 03
Operating System Architecture CS3026 Operating Systems Lecture 03 The Role of an Operating System Service provider Provide a set of services to system users Resource allocator Exploit the hardware resources
More informationCSCE 313: Intro to Computer Systems
CSCE 313 Introduction to Computer Systems Instructor: Dr. Guofei Gu http://courses.cse.tamu.edu/guofei/csce313/ Programs, Processes, and Threads Programs and Processes Threads 1 Programs, Processes, and
More informationArchitectural Support for Operating Systems. Jinkyu Jeong ( Computer Systems Laboratory Sungkyunkwan University
Architectural Support for Operating Systems Jinkyu Jeong ( jinkyu@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu Today s Topics Basic services of OS Basic computer system
More informationMemory management. Single process. Multiple processes. How to: All memory assigned to the process Addresses defined at compile time
Memory management Single process All memory assigned to the process Addresses defined at compile time Multiple processes. How to: assign memory manage addresses? manage relocation? manage program grow?
More informationThreads. What is a thread? Motivation. Single and Multithreaded Processes. Benefits
CS307 What is a thread? Threads A thread is a basic unit of CPU utilization contains a thread ID, a program counter, a register set, and a stack shares with other threads belonging to the same process
More informationSection 3: File I/O, JSON, Generics. Meghan Cowan
Section 3: File I/O, JSON, Generics Meghan Cowan POSIX Family of standards specified by the IEEE Maintains compatibility across variants of Unix-like OS Defines API and standards for basic I/O: file, terminal
More informationCS 326: Operating Systems. Process Execution. Lecture 5
CS 326: Operating Systems Process Execution Lecture 5 Today s Schedule Process Creation Threads Limited Direct Execution Basic Scheduling 2/5/18 CS 326: Operating Systems 2 Today s Schedule Process Creation
More informationPersistent Memory and Media Errors
Persistent Memory and Media Errors Vishal Verma vishal.l.verma@intel.com Vault 2016 1 Or How to have your Poison and (not) consume it too 2 NVDIMM software stack Regular Block IO Application Standard Raw
More informationFile Systems Overview. Jin-Soo Kim ( Computer Systems Laboratory Sungkyunkwan University
File Systems Overview Jin-Soo Kim ( jinsookim@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu Today s Topics File system basics Directory structure File system mounting
More informationCISC2200 Threads Spring 2015
CISC2200 Threads Spring 2015 Process We learn the concept of process A program in execution A process owns some resources A process executes a program => execution state, PC, We learn that bash creates
More informationException-Less System Calls for Event-Driven Servers
Exception-Less System Calls for Event-Driven Servers Livio Soares and Michael Stumm University of Toronto Talk overview At OSDI'10: exception-less system calls Technique targeted at highly threaded servers
More informationUserfaultfd: Post-copy VM migration and beyond
Userfaultfd: Post-copy VM migration and beyond Mike Rapoport This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant
More informationLast class: Today: Thread Background. Thread Systems
1 Last class: Thread Background Today: Thread Systems 2 Threading Systems 3 What kind of problems would you solve with threads? Imagine you are building a web server You could allocate a pool of threads,
More informationProcesses. Operating System CS 217. Supports virtual machines. Provides services: User Process. User Process. OS Kernel. Hardware
es CS 217 Operating System Supports virtual machines Promises each process the illusion of having whole machine to itself Provides services: Protection Scheduling Memory management File systems Synchronization
More information