FEATURE-BASED ROBUST 3-DRS MOTION ESTIMATION FOR HIGH DEFINITION FRAME RATE UP-CONVERSION

Similar documents
An Improved 3DRS Algorithm for Video De-interlacing

Motion Compensated Frame Interpolation Using Motion Vector Refinement with Low Complexity

Classifying Useful Motion Vectors for Efficient Frame Rate Up Conversion of MC-DCT Encoded Video Streams *

A deblocking filter with two separate modes in block-based video coding

Low-cost Multi-hypothesis Motion Compensation for Video Coding

A High Quality/Low Computational Cost Technique for Block Matching Motion Estimation

Motion Estimation using Block Overlap Minimization

Multiframe Blocking-Artifact Reduction for Transform-Coded Video

A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation

Noise filtering for television receivers with reduced memory

Texture based block partitioning method for motion compensated frame interpolation

HYBRID DCT-WIENER-BASED INTERPOLATION VIA LEARNT WIENER FILTER. Kwok-Wai Hung and Wan-Chi Siu

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

Decoding-Assisted Inter Prediction for HEVC

CONTENT ADAPTIVE SCREEN IMAGE SCALING

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS

In the name of Allah. the compassionate, the merciful

Rectification of distorted elemental image array using four markers in three-dimensional integral imaging

An Adaptive Cross Search Algorithm for Block Matching Motion Estimation

Motion Estimation for Video Coding Standards

Enhanced Hexagon with Early Termination Algorithm for Motion estimation

A Symmetric Motion Estimation Method for Motion-Compensated Frame Interpolation

Fast Motion Estimation for Shape Coding in MPEG-4

FAST MOTION ESTIMATION DISCARDING LOW-IMPACT FRACTIONAL BLOCKS. Saverio G. Blasi, Ivan Zupancic and Ebroul Izquierdo

AN EFFICIENT VIDEO WATERMARKING USING COLOR HISTOGRAM ANALYSIS AND BITPLANE IMAGE ARRAYS

Redundancy and Correlation: Temporal

An Edge Based Adaptive Interpolation Algorithm for Image Scaling

Advanced De-Interlacing techniques with the use of Zonal Based Algorithms

A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation

Image contrast enhancement based on local brightness and contouring artifact improvement for large-scale LCD TVs

Semi-Hierarchical Based Motion Estimation Algorithm for the Dirac Video Encoder

BANDWIDTH REDUCTION SCHEMES FOR MPEG-2 TO H.264 TRANSCODER DESIGN

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Affine SKIP and MERGE Modes for Video Coding

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Hybrid Video Compression Using Selective Keyframe Identification and Patch-Based Super-Resolution

An Approach for Reduction of Rain Streaks from a Single Image

Fast and Effective Interpolation Using Median Filter

Effects Of Shadow On Canny Edge Detection through a camera

A Spatial Prediction-Based Motion-Compensated Frame Rate Up-Conversion

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

Motion estimation for video compression

A reversible data hiding based on adaptive prediction technique and histogram shifting

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 9, SEPTEMBER

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Sample Adaptive Offset Optimization in HEVC

Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials

Error Concealment Used for P-Frame on Video Stream over the Internet

Automatic Shadow Removal by Illuminance in HSV Color Space

Vehicle Detection Method using Haar-like Feature on Real Time System

Fingerprint Mosaicking by Rolling with Sliding

Implementation and analysis of Directional DCT in H.264

An Efficient Inter-Frame Coding with Intra Skip Decision in H.264/AVC

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

A New Fast Motion Estimation Algorithm. - Literature Survey. Instructor: Brian L. Evans. Authors: Yue Chen, Yu Wang, Ying Lu.

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Image Inpainting by Hyperbolic Selection of Pixels for Two Dimensional Bicubic Interpolations

FOR THE LAST few decades, many image interpolation

High Performance Hardware Architectures for A Hexagon-Based Motion Estimation Algorithm

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

Low Bitrate Video Communications

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING

FOR compressed video, due to motion prediction and

Learning based face hallucination techniques: A survey

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Homogeneous Transcoding of HEVC for bit rate reduction

AN ADJUSTABLE BLOCK MOTION ESTIMATION ALGORITHM BY MULTIPATH SEARCH

Novel Iterative Back Projection Approach

An Optimized Template Matching Approach to Intra Coding in Video/Image Compression

An Efficient Saliency Based Lossless Video Compression Based On Block-By-Block Basis Method

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

Appearance-Based Place Recognition Using Whole-Image BRISK for Collaborative MultiRobot Localization

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

An Efficient Mode Selection Algorithm for H.264

Fast HEVC Intra Mode Decision Based on Edge Detection and SATD Costs Classification

EE 5359 MULTIMEDIA PROCESSING. Implementation of Moving object detection in. H.264 Compressed Domain

CMPT 365 Multimedia Systems. Media Compression - Video

MOTION VECTOR REFINEMENT FOR FRUC USING SALIENCY AND SEGMENTATION

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation

FAST: A Framework to Accelerate Super- Resolution Processing on Compressed Videos

Reduction of Blocking artifacts in Compressed Medical Images

LOW BIT-RATE INTRA CODING SCHEME BASED ON CONSTRAINED QUANTIZATION AND MEDIAN-TYPE FILTER. Chen Chen and Bing Zeng

Light Field Occlusion Removal

IMPROVED RHOMBUS INTERPOLATION FOR REVERSIBLE WATERMARKING BY DIFFERENCE EXPANSION. Catalin Dragoi, Dinu Coltuc

Department of Game Mobile Contents, Keimyung University, Daemyung3-Dong Nam-Gu, Daegu , Korea

FPGA IMPLEMENTATION OF BIT PLANE ENTROPY ENCODER FOR 3 D DWT BASED VIDEO COMPRESSION

IN RECENT years, multimedia application has become more

A HIGHLY PARALLEL CODING UNIT SIZE SELECTION FOR HEVC. Liron Anavi, Avi Giterman, Maya Fainshtein, Vladi Solomon, and Yair Moshe

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION

A hardware-oriented concurrent TZ search algorithm for High-Efficiency Video Coding

ADAPTIVE INTERPOLATED MOTION COMPENSATED PREDICTION. Wei-Ting Lin, Tejaswi Nanjundaswamy, Kenneth Rose

High-Throughput Parallel Architecture for H.265/HEVC Deblocking Filter *

An Approach for Real Time Moving Object Extraction based on Edge Region Determination

Fast Block-Matching Motion Estimation Using Modified Diamond Search Algorithm

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

Edge Detector Based Fast Level Decision Algorithm for Intra Prediction of HEVC

Constrained PDF based histogram equalization for image constrast enhancement

Transcription:

FEATURE-BASED ROBUST 3-DRS MOTION ESTIMATION FOR HIGH DEFINITION FRAME RATE UP-CONVERSION 1 JUNGWON RYU, 2 SONGHYUN YU, 3 JECHANG JEONG 2 Department of Intelligent Robotics, 1,3 Department of Electronics and Computer Engineering. 1,2,3 Hanyang University, Seoul, Korea E-mail: 1 jungwoncf@hanyang.ac.kr, 2 3069song@naver.co.kr, 3 jjeong@hanyang.ac.kr Abstract- Frame Rate Up-conversion(FRUC) is essential to display a low frame rate to high frame rate video signals within the limited bandwidth. In this paper, we propose the FRUC of high definition(hd) video based on feature-based robust3-d recursive search(3-drs) motion estimation(me)by focusing the problem that 3-DRS algorithm can propagate some wrongly estimated motion vectors(mvs) between original frames. First, we set a feature point patch in previous frame through the features from accelerated segment test(fast) detector by using adaptive thresholding process(atp). Second, to calculate the initial MV, we implement the uni-directional ME from a feature point patch to corresponding block in current frame. Third, the weighted average value of feature point vectors is adopted for initial MV, then,perform the robust 3-DRS ME based on bi-directional by using initial MV to reduce the error propagation and get a true motion. Last, the overlapped block motion compensation(obmc) is performed for getting better visual quality and reducing the blocking artifacts. Experimental results of proposed algorithm show that the performance subsequently improves that of conventional methods. Indexterms- Frame rate up-conversion, Motion estimation, Robust 3-D recursive search, Features from accelerated segment test. I. INTRODUCTION Frame rate up-conversion(fruc) is the technique that shows a low frame rate video signals to high frame rate video signals within the limited bandwidth by generating interpolated frames between original frames. In addition, FRUC can reduce the motion blur in a hold-type display, such as a liquid crystal displays (LCDs) by increasing the frame rate and this technique is also used for film to video conversion. The earliest method of FRUC was that simply duplicates the original frame or averaging its previous and successive frames. However, these methods has a serious drawback in the video sequences with intricate motions. Although these methods work well with stationary or slow motion images, motion fluctuation is detected when the image has large motion. These FRUC methods do not consider motion information between original frames. In order to alleviate disadvantages of previous methods, various FRUC algorithms have been proposed. Among them, motion-compensated frame rate up-conversion(mc-fruc) based on block matching algorithm(bma)[1] is the most popular for reducing the motion fluctuation. MC-FRUC estimates the motion vectors(mvs) between the reference blocks in previous frame and processing blocks in current frame using sum of absolute differences(sad) in ME process. After that,motion compensated interpolation (MCI) is performed to interpolate the new generating frames in the corresponding position of motion trajectory.in conclusion,me process is the most crucial point in MC-FRUC algorithm. In other words, the accuracy and consist of MV is very important issue. There are three types of ME process in FRUC: unidirectional ME by using full search[2], bi-directional ME by using full search[3], and 3-D recursive search(3-drs) ME[4], [5], commonly. As in [2], unidirectional ME algorithm is performed through backward or forward direction by utilizing SAD in search area. Because this algorithm finds the MV between original frames in only one direction. However when generating the interpolated frame, it can occur holes and overlapped areas. Contrary to uni-directional ME, the bi-directional ME algorithm find the MV from the interpolated frame to original frames using sum of bi-directional absolute differences(sbad). It can obtain any holes and overlapped areas. Nevertheless, this method also caused the incorrect MV because of the spurious or false temporal symmetry in background of frame. To improve these demerits, several true ME algorithms have been proposed. One of the most popular true ME methods for FRUC is 3-DRS as in [4], [5]. 3-DRS algorithm is performed to produce a smooth and accurate MV field by using the spatial, temporal, and update candidates.in other words, instead of full search algorithm, 3-DRS limits the number of MVs to those already estimated for reducing the risk of reaching a wrong minimum.it is calculated by using two steps. First, calculate the SAD value between candidate set and current block. Second, select current MV from the candidate MV using minimum SAD value. This 3-DRS algorithm is very efficient and suited for the implementation for a real-time system. However, in 3-DRS ME process, once the MV of a block is wrongly estimated, it tends to be propagated to the not estimated blocks in the interpolated frame. It is called error propagation of 3- DRS ME. In this paper, we propose novel 3-DRS ME algorithm to compensate above issues. The section is organized as follows. The structure of the proposed FRUC 6

algorithm is illustrated in Section II. In Section III, we propose simple and adaptive algorithm about new robust 3-DRS ME based on the FAST detector for reducing error propagation and generating correct MV field. In Section IV, the experimental results are represented through peak signal-to-noise ratio(psnr), structural similarity(ssim), and subjective visual quality. Finally, concludes this paper in Section V. II. OVERALL STRUCTUREOFPROPOSEDALGORITHM Figure 1 shows the overall framework of the proposed FRUC algorithm. First, obtain the corner point that is feature point through FAST detector using adaptive thresholding process (ATP) in the previous frame. It is because the feature point has important information in original frame. Second, set the feature patch centered at feature point in the previous frame, respectively. Third, the unidirectional ME is performed from previous to current frame. Then, to calculate the 3-DRS ME, uni-to-bi directional conversion is implemented. After that, the robust 3-DRS ME is performed by following raster scan order. Finally, get the interpolated frame by OBMC [11]. pixel in the image. Second, set the appropriate threshold value. Third, set the circle consisting of 16 pixels around processing pixel. Then, check the circle pixels whether or not contained in the specific condition as described in [6]. If the number of circle pixels of sequentially contained in the rangeare larger than 9,10,11, or 12, processing pixel is determined as temporaryfeature point. Last, fast test based on ID3 decision tree is performed to exclude the unreliable feature points. The ID3 decision tree method is described in [10]. The FAST detector example image and the feature pixel example are shown in [6]. The FAST detector is fast in speed because it compares pixel intensity value using ID3 decision tree method. In addition, the FAST detector can produce a feature points of adequate quality.however, the FAST detector has a serious disadvantage that depends on the threshold, i.e. user parameter.in order to compensate the defect of threshold, ATP is applied in proposed algorithm. ATP is the adaptive difference value between the pixel intensity average value of neighboring 16 pixels around the processing pixel and the processing pixel intensity value.the neighboring 16 pixels around the processing pixel are already described above.we can obtain the superior feature point by checking the adaptive threshold value. Figure 1: Overall flowchart of the proposed algorithm III. FEATURE-BASEDROBUST 3-DRS ME Figure 2: Example of weighted average using feature patch B. Unidirectional Motion Estimation Using Feature Patch and Uni-to-Bi-directional Conversion Next, the 17 17 patch around the feature point is defined as a feature patch, and a frame is divided into 16 16 blocks. The uni-directional ME is performed by using SAD equation (1) between feature patch in previous frame and current frame. A. FAST detector using ATP Sincethe texture information, i.e. important information, is abundant in the local area around the feature point,the feature point detection is extensively used for image/video processing. Among various feature detectors, the FAST detector [6] is proven to be a good performance. The FAST detector is performed as follows. First, select the processing Where (dx, dy) means the MV coordinate, B and B stands for the horizontal and vertical size of block when we find the MV coordinate, respectively. f and f denote the previous and current frames, and 7

Figure 4: The flowchart of robust 3-DRS ME vdenotes the searched MV. As shown in Figure 2, the interpolatedblock is covered with multiple feature patches. Therefore, we determine the weighted value ω by using the proportion of MV in processing 16 16 blocks for pixel unit. Then, the weighted average is calculated byusingω.after obtaining all feature MVs,uni-to-bi-directional conversion is performed because we use theseuni-to-bi-directional MVs for candidate MVs in robust 3-DRS ME. S, T, and F denotes spatial, temporal, and feature candidates, respectively. The feature candidates are used for preventing the error propagation in 3-DRS ME.In addition, if spatial and feature candidates are same position, the feature candidates are selected as the spatial candidates. Because the feature candidates are more powerful than spatial candidates. The flowchart for robust 3-DRS algorithm is given in Figure 4. First, the proposed algorithm checks whether the vectors in primary candidates are exist belonging to the same object.it represents similar motions in fixed range. In other words, primary candidates are consistent with the motion of the current processing block. Therefore, we calculate the difference between each pair of the candidate MVs in the primary candidate set, as in equation (2). Figure 3: Candidate MV sets C. Robust 3-DRS Motion Estimation The robust 3-DRS algorithm is implemented by using the candidate sets. The candidate sets are composed of two different sets of search locations which are applied based on several PSNR checks. Figure 3 represents the candidate MV sets. Figure 3(a) is consists of a small number of search locations including spatial, temporal, and feature candidate sets. In Figure 3(b), the dilatational search location sets are composed of a lot of locations including the zero MV. v, v mean horizontal and vertical components of a candidate MV, respectively. Where V, V represent each pair of the MVs in the primary candidate set. If the V is smaller than predefined experimental parameter 2, it means that the motion associated with neighboring blocks is likely to be almost same as the motion of the current processing block. Therefore, we set the median valuemof the primary candidate of the current block. However, If only median vector has adopted, the incorrect vector filed can be formed 8

because of the recursive behavior of motion vector selection. Thus, we also set the median vector s random update added value, i.e.m + U.The update values are chosen randomly from set U in (4) and are added to median MV to form new candidate. Then, among these vectors, minimum SBAD is selected as current block s MV. SBAD equation is denoted in (3). Finally, when we fill the all MVs of interpolated frame, OBMC [11] is implemented to generate a new interpolated frame. It is substantially reduce the blocking artifacts and provide high visual quality in video applications by using weighted coefficient of bilinear window and implement by overlapping with the neighboring blocks. The proposed algorithm significantly reduces the number of SBAD calculations, i.e. computational complexity, and maintain the PSNR and subjective visual quality by generating a smoother MV field. And also, it prevent error propagation in 3-DRS ME process and preserve the time while keeping PSNR, SSIM, and time. However, if thev is larger than experimental parameter 2, it means that inconsistent MVs is exist around the current block. Thus, MVs in primary candidate set are searched individually. It means minimum SBAD value is calculated for all primary candidate set. Then, if the minimum SBAD value is smaller than the average SBAD of previous frame SBAD h h, the MV of minimum SBAD value is selected as current MV. But, if the minimum SBAD is larger than SBAD h h, the selected MV represents unavailable value, and therefore additional search locations that consisting of 5 temporal candidates and zero motion should be evaluated. These additional candidates are defined as secondary candidate set. If the minimum SBAD values of secondary candidates are smaller than the minimum SBAD of primary candidate set, the MV of minimum SBAD in secondary candidate set is used for current MV. IV. EXPERIMENTAL RESULTS For evaluating the objective performance of the proposed algorithm, PSNR, SSIM, and processing time are used. These are used for evaluating objective quality of FRUC. The experimental results are compared with trilateral filtering (Trilateral) [7] and high definition video application (HDVA) [8], and direction-select algorithm (DS-ME) [9].The reference algorithms of [7], [8] are applied that uni-directional and bi-directional ME. [8] algorithm is one of the true ME algorithms for HD FRUC. Test conditions of experimental result are as follows. High definition (HD) sequences, which include PeopleOnStreet, Traffic, BQTerrace, BasketballDrill, BQMall, BQSquare, FourPeople are used in the experiment. We opted for the 16 16 block size in all cases.when uni-directional ME based onthe feature point is operated, the search range of full search is set to ±32. Table 1 Average of PSNR, SSIM, and Time of Test Sequences As shown in Table1, PSNR, SSIM, and processing time is calculated. It achieves good performance in the average PSNR, SSIM, and time better than DS- ME and Trilateral because of 3-DRS ME by using 3 different candidates of robust 3-DRS algorithm, i.e. spatial, temporal, and feature candidate, to obstruct error propagation in conventional 3-DRS ME algorithm. Although processing time of the HDVA algorithm is similar with proposed, the proposed algorithm shows an overwhelming performance for speed versus PSNR and SSIM ratio.in addition, Figure 5, 6, and 7 show outstanding performance in 9

subjective visual quality of Traffic(2560 1600),BQMall(832 480),and FourPeople(1280 720). Subjective visual quality also can be greatly improved in our proposed algorithm because we generate error-prevention MV field by using feature detection and perform OBMC to remove blocking artifacts. CONCLUSION In this paper, we proposed robust 3-DRS motion estimation with feature point detection, i.e. FAST detector, for FRUC of HD video applications. It is effectively increases the PSNR and SSIM comparing with existing algorithms by generating errorprevention and consistent MV field. The objective and subjective results are showed that the proposed algorithm has good performance because it adaptively use spatial, temporal, and feature candidates through robust criterion in ME process. ACKNOWLEDGEMENT This work was supported by Institute for Information & communications Technology Promotion(IITP) grant funded by the Korea government(msip) (No.2016-B0101-15- 1377,Development of 16-channel Multi-viewer, H/W Encoder, and Transcoder Systems for 4K UHD Broadcasting) REFERENCES [1] O F. Dufaus and F. Moscheni, Motion estimation techniques for digital TV: A review and a new contribution, Proc. IEEE, vol. 83, pp. 858-876, June. 1995. [2] B. D. Choi, J. W. Han, C. S. Kim, and S. J. Ko, Frame rate upconversion using perspective transform, IEEE Trans. Consumer Electron., vol. 52, no. 3, pp. 975 982, Aug. 2006. [3] S. J. Kang, S. J. Yoo and Y. H. Kim, Dual motion estimation for frame rate up-conversion, IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 12, pp. 1909-1914, Dec. 2010. [4] J. Wang, D. Wang, and W. Zhang, Temporal compensated motion estimation with simple block-based prediction, IEEE Trans. Broadcast., vol. 49, no. 3, pp. 241-248, Sep. 2003. [5] G. de Haan, P. W.A.C. Biezen, H. Huijgen, and O. A. Ojo, True motion estimation with 3-D recursive search block matching, IEEE Trans. Circuits Syst. Video Technol., vol. 3, no. 5, pp. 368-379, Oct. 1993. [6] E. Rosten and T. Drummond, Machine learning for highspeed corner detection, Proc. of the European Conference on Computer Vision, pp. 430-443, 2006. [7] C. Wang, L. Zhang, Y. He, and Y. P. Tan, Frame rate upconversion using trilateral filtering, IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 6, pp. 277-288, Sep. 1998. [8] R. Han and A. Men, "Frame rate up-conversion for highdefinition video application", IEEE Trans. Consum. Electron., vol. 59, no. 1, pp. 229-236, 2013. [9] D. G. Yoo, S. J. Kang and Y. H. Kim, "Direction-select motion estimation for motion-compensated frame rate upconversion", J.Display Technol., vol. 9, no. 10, pp. 840-850, 2013. [10] Chen Jin, Luo De-lin, Mu Fen-xiang, "An Improved ID3 Decision Tree Algorithm," Proc. of 2009 4th International Conference on Computer Science & Education, 2009, pp.127-130. [11] M. T. Orchard and C. J. Sullivan, "Overlapped block motion compensation:an estimation-theoretic approach", IEEE Trans.on Image Process., vol. 3, no. 9, pp. 693-699, 1994. Figure 5: Interpolated results of the 70 th frame of the Traffic(2560 1600) sequence as calculated by (a) Trilateral; (b) HDVA; (c) DS-ME; (d) Proposed. 10

Figure 6: Interpolated results of the 38 th frame of the BQMall(832 480) sequence as calculated by (a) Trilateral; (b) HDVA; (c) DS-ME; (d) Proposed. Figure 7: Interpolated results of the 106 th frame of the FourPeople(1280 720) sequence as calculated by (a) Trilateral; (b) HDVA; (c) DS-ME; (d) Proposed 11