FEATURE. High-throughput Video Data Transfer of Striping with SSDs for 8K Super Hi-Vision

Similar documents
SSD Architecture Considerations for a Spectrum of Enterprise Applications. Alan Fitzgerald, VP and CTO SMART Modular Technologies

Session: Hardware Topic: Disks. Daniel Chang. COP 3502 Introduction to Computer Science. Lecture. Copyright August 2004, Daniel Chang

Solid-state drive controller with embedded RAID functions

I/O CANNOT BE IGNORED

End-to-End Adaptive Packet Aggregation for High-Throughput I/O Bus Network Using Ethernet

COMP283-Lecture 3 Applied Database Management

NVMe: The Protocol for Future SSDs

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

Thomas Polzer Institut für Technische Informatik

Linux Software RAID Level 0 Technique for High Performance Computing by using PCI-Express based SSD

Key Points. Rotational delay vs seek delay Disks are slow. Techniques for making disks faster. Flash and SSDs

Storage. Hwansoo Han

LETTER Solid-State Disk with Double Data Rate DRAM Interface for High-Performance PCs

JMR ELECTRONICS INC. WHITE PAPER

Lecture 23. Finish-up buses Storage

Chapter 6. Storage and Other I/O Topics

Administrivia. CMSC 411 Computer Systems Architecture Lecture 19 Storage Systems, cont. Disks (cont.) Disks - review

CS 261 Fall Mike Lam, Professor. Memory

PCI Express SATA III RAID Controller Card with Mini-SAS Connector (SFF-8087) - HyperDuo SSD Tiering

4th Slide Set Operating Systems

3MG2-P Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

3MG2-P Series. Customer Approver. Approver. Customer: Customer Part Number: Innodisk Part Number: Model Name: Date:

A+ Certification Guide. Chapter 12 Storage Devices

Chapter 6. Storage and Other I/O Topics

Solving the Data Transfer Bottleneck in Digitizers

Australian Journal of Basic and Applied Sciences

Mass-Storage Structure

Concepts Introduced. I/O Cannot Be Ignored. Typical Collection of I/O Devices. I/O Issues

Leverage the size and speed of M.2 drives

2x M.2 NGFF SSD RAID Controller Card plus 2x SATA III Ports - PCIe

CSCI-GA Database Systems Lecture 8: Physical Schema: Storage

Chapter-6. SUBJECT:- Operating System TOPICS:- I/O Management. Created by : - Sanjay Patel

3ME2 Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

A+ Guide to Hardware: Managing, Maintaining, and Troubleshooting, 5e. Chapter 6 Supporting Hard Drives

3ME2 Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

Chapter 6 Storage and Other I/O Topics

Computer Architecture Computer Science & Engineering. Chapter 6. Storage and Other I/O Topics BK TP.HCM

D Demonstration of disturbance recording functions for PQ monitoring

3MG-P Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

3ME Series. Customer Approver. Approver. Customer: Customer Part Number: Innodisk Part Number: Model Name: Date:

CS2410: Computer Architecture. Storage systems. Sangyeun Cho. Computer Science Department University of Pittsburgh

3ME4 Series. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date: Innodisk Approver. Customer Approver

CSE 120. Operating Systems. March 27, 2014 Lecture 17. Mass Storage. Instructor: Neil Rhodes. Wednesday, March 26, 14

A+ Guide to Hardware: Managing, Maintaining, and Troubleshooting, 5e. Chapter 6 Supporting Hard Drives

3ME4 Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

Storage Systems. Storage Systems

I/O CANNOT BE IGNORED

Presented by: Nafiseh Mahmoudi Spring 2017

Chapter 9: Peripheral Devices. By: Derek Hildreth Chad Davis

DMP SATA DOM SDM-4G-V SDM-8G-V SDM-16G-V

HP Z Turbo Drive G2 PCIe SSD

Chapter 6. Storage and Other I/O Topics. ICE3003: Computer Architecture Fall 2012 Euiseong Seo

3ME3 Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

Introduction I/O 1. I/O devices can be characterized by Behavior: input, output, storage Partner: human or machine Data rate: bytes/sec, transfers/sec

Virtual Memory. Reading. Sections 5.4, 5.5, 5.6, 5.8, 5.10 (2) Lecture notes from MKP and S. Yalamanchili

3MG2-P Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

Disks and RAID. CS 4410 Operating Systems. [R. Agarwal, L. Alvisi, A. Bracy, E. Sirer, R. Van Renesse]

Storage. CS 3410 Computer System Organization & Programming

Chapter 6. Storage and Other I/O Topics. ICE3003: Computer Architecture Spring 2014 Euiseong Seo

3SR-P Series. Customer Approver. Approver. Customer: Customer Part Number: Innodisk Part Number: Model Name: Date:

Adaptec Series 7 SAS/SATA RAID Adapters

3ME3 Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

RAID SEMINAR REPORT /09/2004 Asha.P.M NO: 612 S7 ECE

1ME Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

3ME4 Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

4 Port PCI Express 2.0 SATA III 6Gbps RAID Controller Card with HyperDuo SSD Tiering

Samsung PM1725a NVMe SSD

Address Accessible Memories. A.R. Hurson Department of Computer Science Missouri University of Science & Technology

Thunderbolt 3 to esata Adapter + USB 3.1 (10Gbps) Port - Mac / Windows

WHITE PAPER. Know Your SSDs. Why SSDs? How do you make SSDs cost-effective? How do you get the most out of SSDs?

Mass-Storage Structure

3ME3 Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

Che-Wei Chang Department of Computer Science and Information Engineering, Chang Gung University

Mass-Storage. ICS332 - Fall 2017 Operating Systems. Henri Casanova

PC-based data acquisition II

3 Port PCI Express 2.0 SATA III 6 Gbps RAID Controller Card w/ msata Slot and HyperDuo SSD Tiering

College of Computer & Information Science Spring 2010 Northeastern University 12 March 2010

Modern RAID Technology. RAID Primer A Configuration Guide

Implementing a Statically Adaptive Software RAID System

Lenovo RAID Introduction Reference Information

Serial ATA (SATA) Interface. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University

2 Port PCI Express 2.0 SATA III 6Gbps RAID Controller Card w/ 2 msata Slots and HyperDuo SSD Tiering

FC-NVMe. NVMe over Fabrics. Fibre Channel the most trusted fabric can transport NVMe natively. White Paper

Chapter 12: Query Processing

2-Port PCI Express 2.0 SATA III 6Gbps RAID Controller Card with 2 msata Slots and HyperDuo SSD Tiering

3 Port PCI Express 2.0 SATA III 6 Gbps RAID Controller Card w/ msata Slot and HyperDuo SSD Tiering

Chapter 12: Mass-Storage

Chapter 10: Mass-Storage Systems

3ME Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

FAQs HP Z Turbo Drive Quad Pro

Storage Systems : Disks and SSDs. Manu Awasthi CASS 2018

The devices can be set up with RAID for additional performance and redundancy using software RAID. Models HP Z Turbo Drive Quad Pro 2x512GB PCIe SSD

Chapter 6. Storage & Other I/O

Chapter 10: Mass-Storage Systems. Operating System Concepts 9 th Edition

Intel Rapid Storage Technology (Intel RST) Production Version Release

The Transition to PCI Express* for Client SSDs

3MG-P Series. Customer Approver. Innodisk Approver. Customer: Customer Part Number: Innodisk Part Number: Innodisk Model Name: Date:

P51: High Performance Networking

NVMe Made Affordable. Toshiba OCZ RC100 Series. Vector 180 Series Product Brief. NVMe Solid State Drive (SSD) May 2018

Extending the NVMHCI Standard to Enterprise

Transcription:

High-throughput Video Data Transfer of Striping with s for 8K Super Hi-Vision Takeshi KAJIYAMA, Kodai KIKUCHI and Eiichi MIYASHITA We have developed a high-throughput recording and playback method for full-featured 8K Super Hi-vision (8K) video whose data rate is up to. GBps. The method fully exploits hardware with a specific design for handling large amounts of data, sequential transfer, and frame unit random reads including fast-forward, rewind and jog-shuttle. Steady recording and playback, including frame unit random reads, were demonstrated on an 8K recorder and player.. Introduction 8K Super Hi-Vision (8K) )-5) is an ultrarealistic video and audio system consisting of ultrahigh-definition (UHD) images with megapixels, 6 times that of HD, and. multichannel sound. The throughput required for full-featured, uncompressed 8K video with a frame rate of 0 Hz, quantization of bits, and a full resolution of 7,680 x,0 in green, and blue (RGB) is 8 GBps * ( Gbps). We have previously developed compressed recording equipment using video compression with high-speed signal recording 6)7). This equipment used low compression rates (for high-image quality) so that it could be used for editing and other program production work within broadcast stations, resulting in after-compression transmission rates of. GBps (approx. 0 Gbps, 0 MB per frame) for compressed video data and audio data in :: format * and. GBps (approx. 0 Gbps, 0 MB per frame) for content in ::0 format *. Thus, high-throughput recording and playback memory is required. Solid-state drives (s) 8), which have higher throughput than hard disk drives (HDDs), are attractive as memory for video data at the above rates. s use NAND flash memory 9)*, a solid-state, non-volatile memory, as the recording medium. They are suitable for memory that needs to be impact-resistant and portable with removability. * Bps is a unit indicating the amount of data, in bytes, that can be transmitted in one second. * A high-quality video format that records all color components of the video without chroma subsampling. * A recording format that subsamples color components of the video. The red and blue components have only / of the original amount of information. * A type of non-volatile solid-state memory that is inexpensive and has high capacities as compared to other non-volatile semiconductor memories. Writing and erasing is fast but overwriting individual bytes is slower due to be required complicated procedure inside. However, ordinary s are limited by the Serial ATA 0)*5 (SATA) interface specifications to a maximum throughput of 0.6 GBps, so they cannot be used individually to record compressed 8K. A technology called redundant arrays of inexpensive disks (RAIDs) ), which parallelizes memory, can be used effectively to implement 8K compressed recording using s. Several levels of RAID are defined, depending on the desired recording and playback throughput, recording capacity, and reliability. One of these levels (RAID 0) is called striping, and stores data partitioned into N parts on N separate memories. This has no redundancy, but it has N times the recording capacity and theoretically can achieve N times the recording and playback throughput. However, when RAID is actually applied, it is difficult to achieve both sequential transfer *6 and random transfer *7 performance. In particular, ordinary products incorporating RAID have a strong tendency to emphasize performance for random transfer, and when used for video data recording, which requires high sequential transfer performance, the expected performance is not obtained. Here we propose striping technology and data transfer control methods that can improve the sequential throughput for video with high data rates. This article describes a high-throughput recording technology with a throughput of up to. GBps and up to 0 MB per frame, which considers various data access patterns including sequential writing for recording, sequential reading for playback, and random reading of individual frames for special playback modes such as fast forward, rewind, and using a jog shuttle.. Striping technology This section first discusses examples of striping configurations and performance evaluation. It then explains the causes of throughput bottleneck as the number of parallel s increases, in light of the evaluation results, and de- *5 A high-speed serial interface for connecting computer and memory. Gen, the second-generation specification, is capable of transmission up to 0.75 GBps (0. GBps in practice), and Gen, the third-generation specification, is capable of transmission up to 0.75 GBps (0.6 GBps in practice). *6 A data access pattern in which data is written or read from consecutive addresses. *7 A data access pattern in which data is written or read from addresses that are not consecutive.

scribes our basic concept for improving throughput. *8 A computer expansion bus specification. It is a high-speed serial interface, expandable to up to transmission paths (lanes) according to the specification. In Gen, the secondgeneration specification, each lane can transmit 5 Gbps (0.5 GBps in practice). *9 A high-speed serial interface used to connect computer and memory. A successor specification to the Small Computer System Interface (SCSI) parallel bus connection.. Examples of striping configurations Striping control can be divided into hardware striping and software striping. An example of a hardware striping configuration is shown in Fig.. Hardware striping is provided by a dedicated RAID board and can provide high throughput through a dedicated large-scale integration (LSI) RAID 0 controller, but the maximum number of s that can be used is determined by the product. Generally, the host interface is PCI Express (PCIe) )*8 and the interface is SATA or Serial Attached SCSI (SAS) *9. An example of a software striping configuration is shown in Fig.. With software striping, the RAID 0 controller operates on the host in the software, so throughput depends on the CPU performance and load, but the number of s connected can be increased by using expansion boards that convert the PCIe interface to several SATA or SAS interfaces. In the next section, we discuss the results of investigating the types of transfer bottlenecks that occur and the different characteristics of these two types of striping control.. Evaluating striping performance The theoretical throughput expected with the striping of N parallel s is N times the individual throughput, but in fact, there is a tendency toward saturation as N increases. To examine this phenomenon in detail, we measured the throughput for the hardware striping configuration shown in Fig. using three different s (labeled A, B, and C) from different manufacturers and with different capacities and connector types. We used a RAID board that can accommodate up to eight s. The interface between the host personal computer (PC) and the RAID board was PCIe with the Gen x 8 lane throughput bandwidth (i.e., the maximum throughput) of GBps. The interfaces between the RAID board and s were SATA with the bandwidth per of Gen, i.e., 0.6 GBps. The total throughput bandwidth for eight s was thus.8 GBps. The specifications of the hardware used for the measurements are shown in Table. The relationships between the number of parallel s, N, and the sequential write and sequential read throughputs for hardware striping are shown in Figs. and, respectively. The solid lines are the measured values, while the dotted Host (PC) RAID board RAID 0 controller Host interface PCIe Gen x 8 lanes (Max. GBps) interface SATA Gen (Max. 0.6 GBps per unit) Figure : Example of hardware striping configuration Host (PC) Software RAID 0 Controller Expansion board Host interface PCIe Gen x 8 lanes interface SATA Gen Figure : Example of software striping configuration Table : Hardware specifications Host PC RAID board Expansion board A B C * A smaller SATA connector than that usually used for SATA interfaces.

Sequential write throughput (GBps) Sequential read throughput (GBps) A A (Theoretical values) B B (Theoretical values) C C (Theoretical values) 0 0 8 Number of parallel s (N) Figure : Number of parallel s (N) and sequential write throughput for hardware striping A A (Theoretical values) B B (Theoretical values) C C (Theoretical values) 0 0 8 Number of parallel s (N) Figure : Number of parallel s (N) and sequential read throughput for hardware striping lines are the theoretical values, derived by multiplying the individual throughput by N. The results in Fig. show that for all three brands, the throughput increased up to N= but then saturated for N=8, yielding results much lower than the theoretical value. Figure shows similar results for reading, with saturation when N=8. The maximum throughput for both writing and reading was under GBps, which is less than half of the maximum PCIe throughput of GBps (8 x 0.5 GBps). Also, for B, the throughput for both writing and reading when N=8 was actually less than the throughput for N=, suggesting that the cause was not only the performance limits of the RAID board hardware but was also related to the number of parallel devices, N. Similarly, we measured the throughput for software striping with the configuration shown in Fig.. We used an expansion board that can connect up to 6 s. The interface between the host PC and expansion board was PCIe Gen x 8 lanes, with a bus bandwidth of GBps, the same as in the hardware striping measurements. The interfaces between the expansion board and s were SATA Gen (bus bandwidth 0.6 GBps) with a maximum transfer bandwidth of 9.6 GBps when 6 s are connected. The relationships between the numbers of parallel s, N, used with software striping and the sequential write and read throughputs are shown in Figs. 5 and 6, respectively. Solid lines are measured values and dotted lines are theoretical values, derived by multiplying the individual Sequential write throughput (GBps) Sequential read throughput (GBps) A A (Theoretical values) B B (Theoretical values) C C (Theoretical values) 0 0 8 6 Number of parallel s (N) Figure 5: Number of parallel s (N) and sequential write throughput for software striping A A (Theoretical values) B B (Theoretical values) C C (Theoretical values) 0 0 8 6 Number of parallel s (N) Figure 6: Number of parallel s (N) and sequential read throughput for software striping throughput by N. Figure 5 shows that the measured write throughput for N=8 was much lower than the theoretical value, and for N=6 it was mostly saturated. The maximum write throughput for B was about. GBps when N=6 but was less than GBps for both A and C; all these values were approximately half of the maximum throughput for PCIe of GBps. These results, and the fact that the saturation rates for each case were different, suggest that the cause of the bottleneck was not simply hardware performance limitations. Figure 6 also shows that the read throughput was much less than the theoretical value for N=8 and that the maximum throughput for N=6 under saturation was below GBps for all three drive brands.. Factors causing throughput bottlenecks There are two possible factors resulting in the bottlenecks that cause the throughput saturation discussed in the previous section. The first factor is a limitation on the hardware operating speed, which is the operating speed of the RAID board for hardware striping, and the operating speeds of the expansion board and PC CPU for software striping. These factors depend on specifications such as the hardware operating frequency and bus bandwidth, so such bottlenecks need to be eliminated at the hardware design stage to achieve the desired performance. The second factor is the transmission overhead *0 for the host interface (PCIe) and interfaces (SATA), and is

common to both hardware and software striping. One way to deal with this factor is to increase the hardware performance to.5 times or double, so that the transfer is possible even with the overhead, but when processing high-speed video data such as 8K, increasing the operating frequency and other aspects is not practical. Thus, to avoid bottlenecks, it is important to eliminate the transmission overhead.. Analyzing video data transmission characteristics and improving transmission speed In this section we give the results of analyzing the relationship between throughput and data size, assuming a very large amount of video data, with the goal of eliminating PCIe and SATA interface bottlenecks. Such bottlenecks have been thought to be a cause of the drop in throughput for both hardware and software striping. From the results of this analysis, we describe a data transmission control method for eliminating the transmission overhead and improving throughput. To investigate the relationship between data size and throughput, we built the test board shown in Fig. 7. To eliminate bottlenecks due to the speed of the host PC CPU in our analysis, the test board uses hardware striping. The test board has three field-programmable gate arrays (FPGA), *0 When transmitting data in computers and other devices, processing and procedures are required, in addition to the transmitted data itself. and the low-voltage differential signaling (LVDS) on the internal bus maintains a total bandwidth of GBps. The host interface is the same as that in Figs. and, PCIe Gen x 8 lanes. The interfaces are SATA Gen with up to 0. GBps bandwidth and there are connections for up to 6 s. The test board had msata connecters, which are smaller than regular SATA connectors, and C devices with msata connecters were used in the remainder of this analysis.. Analysis of PCIe transmission characteristics PCIe provides direct memory access (DMA) for the efficient transfer of large-volume data such as video. Figure 8 is a diagram explaining the transmission time for a single frame of video data. R in the figure is the number of DMA transfers needed to transfer one frame of video data, Tstart is the latency * for starting a DMA transfer, Thd is the packet header transfer time, and Tdata is the packet data transfer time. Figure 8 suggests that we can improve the overall transmission time by reducing Tstart, but Tstart depends on the hardware performance of the host controlling the PCIe bus and the board, so this approach would be difficult. Thus, we attempted to increase the data rate by increasing the data size of each DMA transfer, thereby reducing R and the number of times Tstart is needed. To study this effect, we measured the relationship be- * The delay time needed from a data transmission request until the result is returned. Host (PC or recorder) FPGA Host interface PCIe Gen x 8 lanes (Max. GBps) FPGA FPGA Max. 6 in parallel Test board LVDS (max. total GBps) Figure 7: Test board interface SATA Gen (Max. 0. GBps per unit) Video frame transmission time R Time for one DMA transmission T hd T data T start Packet R: Number of repeated DMA transmissions needed to transmit one frame of video data Tstart: Latency to start a DMA transmission Thd: Time to transmit the packet header Tdata: Time to transmit the packet data Figure 8: Video frame transmission time using DMA transmission 5

.6 0.7...0 DMA read DMA write 0.5 0. 0. Read Write.8 0 8 6 0 DMA transmission data size (MB) Figure 9: DMA transmission data size and PCIe throughput 0.9 5,0,56,08,560,07 Number of transmission sectors Figure 0: Number of transmission SATA sectors and throughput tween the DMA transfer data size and PCIe throughput using the test board. The results are shown in Fig. 9. These measurements were taken between the test board and the main memory * of the host PC. The DMA transfer data size is shown on the horizontal axis and the PCIe throughput for sequential transfer is shown on the vertical axis. Figure 9 shows that for a data size of MB, the throughput did not reach GBps, but as the data size increased, so did the throughput, until it surpassed. GBps at 0 MB. This shows that the throughput can be increased by increasing the DMA transfer data size.. Analysis of SATA transmission characteristics We also studied how to improve the throughput for the SATA interfaces in the same way as for the PCIe throughput by increasing the transfer size. s manage data internally in 5-byte units called sectors *, and the number of sectors transferred through the SATA interface for each transfer command can be selected. Thus, we measured the relationship between the number of sectors transferred with each transfer command and the sequential write and read throughputs. The results are shown in Fig. 0. These measurements were carried out using the test board and C devices from Section.. The number of sectors is on the horizontal axis and the throughput for a single is on the vertical axis. From Fig. 0, we see that for both writing and reading, the throughput increases as the number of sectors increases up to 0 sectors, and thereafter it increases more slowly. In these tests, the highest throughput was achieved for,07 sectors, which was 0.5 GBps for writing and 0.6 GBps for reading. These results show that high throughput can be obtained by setting the number of sectors for each transfer command to greater than,0.. Evaluation of random read rates In this section, we verify the relationship between the * Main memory. Extremely fast relative to HDD or but volatile and smaller in capacity. * The smallest unit of recording in a recording medium. Used mainly for disk-based media but also used for s, which have interfaces that are compatible with HDDs. random read data size and throughput. When recording, video data is exclusively written with sequential transfer, but when reading, random transfer in frame units is also used. It is also known that the performance is generally lower for random transfer than for sequential transfer. For these reasons, we evaluated the random read performance. The results of measuring the random read performance using C are shown in Fig.. The data size of random read operations is shown on the horizontal axis and the throughput is shown on the vertical axis. The random read throughput increases quickly with the data size up to near,0 kb, and then continues to increase slowly. This result suggests that by ensuring data sizes of at least,0 kb, good random read performance can be attained. 0.8 0. 0.0 0.6 0. 0,0,08,07,096 Random read data size (kb) Figure : Random read data size and transmission throughput. Video data transmission control to improve transmission throughput Below, we describe a video data transfer control method utilizing the results above to improve throughput. As discussed earlier, the throughput on PCIe and SATA interfaces increases as the amount of transferred data increases, but with video data, random reads in single frame units must also be considered for special playback operations such as fast forward, rewind, and use of the jog shuttle. For these cases, video data must be managed in units that are the same size of or less than the video frame size. Accordingly, we propose a video data transfer method that can 6

Table : Transmission control parameters and transmission throughput Video frame size DF and PCIe DMA transfer rate DF (MB) Throughput Number of parallel s (N) DS and SATA random read throughput DS (kb) Throughput No. of transmission sectors per SATA command and throughput S (no. of sectors) Throughput 0 0 6 8 6 8,80,560 60,80,560 5,0,80,560 handle data in video frame units. To maximize the PCIe throughput, we set the data size transferred between the host and the test board to equal the video frame size, and call it DF. To maximize the SATA throughput, we set the data size transferred between the test board and each of the s, DS, by dividing DF by the number of parallel s, N, so that DS = DF/N. Thus, the number of sectors corresponding to DS is given by S = DS/5, which we take as the maximum number of sectors transferrable through the SATA interfaces in a single command. The results of evaluating throughput using sample values for these parameters are summarized in Table. Parameters that yielded high throughput are indicated with a circle in Table. Bottlenecks in the PCIe and SATA interfaces can be reduced by controlling transfers according to the parameters shown in Table. That is, as shown by the results in Fig. 9, the larger the PCIe DMA transfer size, the more the bottleneck is eliminated and the higher the throughput achieved. By making the DMA transfer size close to the frame size, DF, the bottleneck was eliminated. Specifically, with DF=0 MB, over. GBps was achieved, and with DF=0 MB, over GBps was achieved. The results in Fig. 0 show that the bottleneck can be avoided by setting the number of sectors transferred per SATA command to greater than,0. Note here that the number of sectors transferred, S, must be inversely proportional to the number of parallel memories, N, and under the conditions shown in Table, the number of transferred sectors, S, is,0 or greater, so bottlenecks can be eliminated. For random reads, we can set the parallelism, N, such that the DS value does not drop below,0 kb to achieve high throughput. For the parameters shown in Table, when DF = 0 MB with parallelism N =6, and when DF = 0 MB with parallelism N = 8, DS was,0 or greater, resulting in good performance, but when DF = 0 MB and N = 6, the random read performance was lower.. Evaluating the transmission speed of striping with the proposed method We applied the proposed method using the test board in Fig. 7 and up to 6 C devices, and evaluated the throughput between the host PC and the striped s. Figure shows results of measuring the relation between the striping parallelism, N, and write throughput for a video frame size of DF = 0 MB. The dotted line shows the theoretical values, obtained by multiplying the individual transfer rate by N, and the solid line shows the measured results. The write throughput increased almost proportionally with N, achieving. GBps when N = 6. Figure shows the results of measuring the read throughput under the same conditions. The read throughput also increased with N, achieving.8 GBps at N = 6. These results show that the proposed method reduced the transfer overhead, achieving write throughput nearly proportional to N up to N = 6. Similarly, the method achieved read throughput comparable to the write values, which, while dropping somewhat below the theoretical throughput, continued to increase up to N = 6. One possible reason that the read throughput in Fig. dropped below the theoretical value is as follows. Generally Proposed method Theoretical values 0 0 8 6 Number of parallel s (N) Figure : Number of parallel s and write transmission throughput using the proposed method Proposed method Theoretical values 0 0 8 6 Number of parallel s (N) Figure : Number of parallel s and read transmission throughput using the proposed method 7

an upper limit for the throughput for N stripped s is N times the throughput of the slowest of the N s. Thus, as N increases, the probability that the transfer rate will drop increases. In particular, s have sufficient cache memory to maintain stable write transfer rates, but many products cannot use this cache for reading, and this tendency may be quite noticeable. For these measurements, we used C because we were constrained by the type of connector on the test board, but see similar results are expected for s A and B, which have similar or better individual device performance. 5. Implementation and evaluation in an 8K recording system We implemented the proposed method in a prototype 8K recording equipment and evaluated its recording and playback performance. We compressed full-featured 8K video with a data rate of. GBps, a video frame size of 0 MB, and ::0 format. To record and play back this video data, we used the test board and eight C devices in parallel. Previously, we measured the performance of this configuration using the host PC as shown in Fig. 7 with a set of parameters in Table, DF = 0 MB, N = 8, DS =,80 kbyte, and S =,560, achieving a maximum write throughput of.57 GBps and maximum read throughput of.6 GBps. To verify the stability of the memory, we performed recording (sequential write), normal playback (sequential read), and special playback using fast forward, rewind, and a jog shuttle (random read), and were able to confirm stable operation in all cases. 6. Conclusion We have proposed a high-speed transfer control method for video data and hardware striping to be used as a high-speed storage technology for 8K video. The proposed method eliminates the transmission overhead on the PCIe and SATA interfaces in the striping hardware to achieve high throughput. By using the video frame data size as the basic unit of transmission, the method can realize high throughput for sequential reading and writing for basic recording and playback and also random reading in frame units. The throughput of a conventional RAID 0 controller was largely saturated for eight parallel s, but we were able to confirm increasing throughput up to 6 parallel s using our prototype test board and our proposed method. With 6 parallel disks, we achieved a maximum throughput of over GBps, higher than the. GBps required to record compressed full-featured 8K video. Although we evaluated this method using hardware striping control on a test board, it should also be effective when applied to software striping. We also confirmed that the proposed method is useful for recording, regular playback, and special types of playback for. GBps compressed video data using 8K recorder with eight parallel devices. This article was edited and revised on the basis of the following article, which appeared in ITE Transactions on Media Technology and Applications. T. Kajiyama, K. Kikuchi and E. Miyashita: High-throughput Video Data Access Control of Striping with s for 8K Super Hi-Vision, ITE Transactions on Media Technology and Applications, Vol., No., pp. 86-9 (06) References ) M. Sugawara, M. Kanazawa, K. Mitani, H. Yamashita and F. Okano: Ultrahigh-Definition Video System with 000 Scanning Lines, SMPTE Mot. Imag. J., Vol., No. 0&, pp. 9-6 (00) ) T. Yamashita, K. Masaoka, K. Ohmura, M. Emoto, Y. Nishida and M. Sugawara: Super Hi-Vision: Video Parameters for Next-Generation Television, SMPTE Mot. Imag. J., Vol., No., pp. 6-68 (0) ) SMPTE ST 06-, Ultra High Definition Television-Audio Characteristics and Audio Channel Mapping for Program Production (008) ) SMPTE ST 06-, Ultra-High Definition Television-Image Parameter Values for Program Production (0) 5) ITU-R Recommendation BT.00, Parameter Values for Ultra-High Definition Television Systems for Production and International Program Exchange (0) 6) T. Kajiyama, K. Kikuchi and E. Miyashita: 8K Super Hi- Vision Video Compression Recorder, ITE Technical Report. Media Technol., Vol. 9, No. 7, HI05-7, ME05-7, AIT05-7, MMS05-9, CE05-9, pp. - (05) 7) K. Kikuchi, T. Kajiyama and E. Miyashita: 8K Super Hi-Vision Compression Recorder Using Compact Bit-Rate Control Method, International Workshop on Advanced Image Technology, C-6 (06) 8) R. Micheloni, A. Marelli and K. Eshghi: Inside Solid State Drives (s), Springer Science & Business Media (0) 9) R. Micheloni, L. Crippa and A. Marelli: Inside NAND Flash Memories, Springer Science & Business Media (00) 0) Serial ATA International Organization, Serial ATA Revision.0 (009) ) D. A. Patterson, G. Gibson and R. H. Katz: A Case for Redundant Arrays of Inexpensive Disks (RAID), Proceedings of the 988 ACM SIGMOD International Conference on Management of Data, pp. 09-6 (988) ) PCI-SIG, PCI Express Base Specification Revision.a (05) 8