《周幸幸 文献翻译.doc》由会员分享,可在线阅读,更多相关《周幸幸 文献翻译.doc(18页珍藏版)》请在taowenge.com淘文阁网|工程机械CAD图纸|机械工程制图|CAD装配图下载|SolidWorks_CaTia_CAD_UG_PROE_设计图分享下载上搜索。
1、单位代码 01 学 号 分 类 号 TP391 密 级 文献翻译基于相邻帧的差别预测的多分辨率小波域运动估计和补偿 院(系)名称信息工程学院 专业名称计算机科学与技术 学生姓名周幸幸 指导教师王学春2011年3月26日英语译文基于相邻帧的差别预测的多分辨率小波域运动估计和补偿唐国伟摘要:针对高比特率的运动矢量编码和占用更多时间负载的全部搜索策略,提出了基于相邻帧的差别预测的多分辨率小波域运动估计和运动补偿算法。图像序列使用微分运动检测方法,并选择适当的阈值来确定相连区域。接下来提取出运动的区域来进行运动估计和运动补偿。实验结果表明,该算法提高了运动矢量编码的效率,降低了运动估计的复杂度,在相同
2、的比特率的情况下,重构图像具有更好的质量,多分辨率运动估计算法(MRME)得以改善。关键词:运动估计;运动补偿;多分辨率分析;视频编码1、引 言 由于小波分析的时频局部化性能,被广泛应用于图像/视频编码领域。正如图像序列,运动估计和运动补偿可以有效地减少相对的时间并且有效地改进编码。然而传统的运动补偿小波编码,以其运动估测的结构再加上帧的内部静止图像编码,不能充分利用固有的多分辨率特性的优势。在1992年,Yaqin Zhang和S.Zafar提出可变方块多分辨率运动估测的fMRME1视频压缩算法,即1aid的运动估测和运动补偿基础是在在小域中。通过使用相对较小的搜索窗口和匹配块,这种方法可以
3、有效的减少运算量,摆脱方块效应,并有易于使视频可扩展编码适合人类视觉系统并实现进步的过渡。但是在MRME算法中,存在着不连续的运动矢量的默认值和实物边界和方块边界的不一致这导致了高频率在系数变换中的成分增加,并影响位移帧的差DFD的编码。Zan Jinwen提出通过多分辨率运动估测中值滤波,可产生更平稳的运动领域,产生一个更好估计性能。但中值滤波会在不平稳高分辨率的子带运动中带来相当负面的影响Su141作了理论推断,深入研究小波系数的插补算法并提出半像素多分辨率运动补偿,有效提高运动估计的准确性。为了克服了离散的小波变换不断转移的属性。Zhang提出基于2声道高通讯滤波和子带的小波域运动估测适
4、应中央搜索点预测.此方法有一个相当低的计算复杂度,但编码器的性能也从PSNR值的数据中减少了。Cagnazzo 所研究的基于视频编码的小波理论性优化准则,提出一个最佳运动估计和补偿的方法,但是此方法是以扩展一个复杂性的成本为代价的法,但在一个扩展的COM成本复杂性。 本文提出了一种帧差的邻接预测MRME算法(即FDMRME)它采用差动检测图像序列,并提取出运动区域进行运动估计和补偿。这种方法减少运动估测的复杂性,提高了运动矢量的编码的高效性,并提出在同一比特率情况下,作为MRME的图像重建的质量。2、基于帧差的运动检测(1) 三帧差算法运动检测包括光流的算法,背景消除算法,相邻三帧差法。利用连
5、续三幅图像,使差异化操作并进行AND运算导致出不同的结果,三帧差算法可以快速从图像序列中检测出运动区域。检测过程如图1所示。图像差 g1(x,y)二值化图像差e g2(x,y)二值化提取运动对象去噪和连通区域标识AND操作kk+1LL3 帧k-1e k-1图1 用三帧差算法对运动对象的检测过程g1(x,y)是前两个帧的运动图像的变化,g2(x,y)是后两个帧。g1(x,y)和g2(x,y)都包含有运动的信息。使两个运动变化图像二值化,并为他们进行AND运算以获得运动对象。(2) 不同图像的阈值选择为了提取运动目标,有必要选择合适的阈值T作为框架。不同的图像g1(x,y)和 g2(x,y) 通过
6、使用基于灰度特征的方法,然后通过阈值T使帧差图像二值化。阈值选择的过程包含4个步骤。第一、在三帧图像中,将第一和第二帧分成2X2的块。在第二帧上每块增加4个像素以获得总和a,然后每个相应的块加上的第一帧的4个像素以获得bi。这里m,n分别代表图像的长和宽,k是22个块的数量。第二、通过使用阈T=1.1S,使第一第二帧的帧差图像二值化,得到二值化图像。在这一步骤中,获得一个粗略的阈值,进而可以得到一个被罚阈值。第三、在帧差图像中,计算像素的平均值小于阈值T。q是指在帧差图像中像素数量小于阈值T。以M为第二和第一帧的帧差图像的阈值,然后使帧差图像二值化。第四、通过阈值M使第三和第二帧的帧差图像二值
7、化然后在帧差图像中计算出像素的平均值小于阈值M。以平均值作为第三和第二帧的帧差图像的阈值,然后二值化帧差图像。(3) 识别所连接的区域和运动区域的坐标的提取标注出AND运算图像以通过使对象聚类的方法得出运动区域的坐标,包括以下2个步骤。第一、标注出每个对象的像素从左到右,从上到下扫描AND运算的图像。每当遇到一个属于目标区域的像素(灰度级为1),就检测该像素的相邻的8个像素。如果这8个像素没有被标注出来,就用和一个新的标签号码标注他们(从1开始),每一次的数量增加1。否则就把当前像素的标签号码标注成这8个相邻像素中最小的编号。第二、群集每个对象区域从头到尾一行一行的扫描运动物体图像(从左到右,
8、然后由右至左)。每当遇到一个对象的像素,检测该像素的8个相邻的像素。如果这8个中最小的标签号比该像素的标签号大,它就会被这8个中小的标签号所取代。当整个图象已经扫描完之后,再从后到前以同样的方式再扫描一下的图像,直到所有对象像素的标签不再变化为止。当AND差异图像已被对象聚类过程处理过之后,头肩图像可能有一些异常区域。降噪这些异常区域并把它们作为图像的运动区域保存下来。3、差异相邻块运动估计和补偿第1步,把3级小波分解到图像序列,并进行运动检测,通过使用三帧差法,以得出最低频率子带LL3。提取运动区域,并把他们分成2 2个方块。假设每块的运动矢量是V3(x,y),其他3子带的运动矢量为也(x,
9、y)。定义一个可靠的标记R。如果不是所有的像素在方块所共享的同一状态,那么就可以断定这个块所在的运动物体边界是不可靠的运动估计。第2步,检查当前方块低频率子带相应的像素标志,如果他们属于静止区域则R= 1且运动矢量是0,这并不需要估计。而在其他子带对应位置的运动矢量也为0。第3步,如果所有的LL3当前方块属于运动区域。则R= 1,据相邻方块运动的预测,块(x,y)与相邻方块之间的关系如图2所示。 a1的值是0或1。如果相应的方块位于同一地区的当前块,那么这个值是1。否则则是0。确定是否超过预测值,绑定与否。如果不是,那么就把预测值当成中心开始搜索。为了保持一致性,把平均绝对差(MAD)作为匹配
10、准则。1234 当前块图2 当前块R=1以及相邻方块的略图第4步,如果有一个对象,它的像素即是它的运动像素,同时也是当前方块的像素。那么可以推断当前方块位于运动区域边界处。该运动是不可靠,它不能被当成是可靠的方块来进行处理。为了促进了预测的可靠性,需要知道更多的信息。所以这些随后会来处理。当第一个扫描完成后,所有可靠的方块运动矢量已实现。然后,毗邻不可靠方块的可靠方块的运动矢量可以被用来于作出预测。当前方块R = 0时与相邻方块之间的位置关系如图3所示。12345678 当前块图3 当前块R=0以及相邻方块的略图第5步,通过参照帧LL3子带使用方块运动矢量,在其他子带中估计相应的方块。对于m(
11、m 3)子图像,方块(x,y)的运动矢量的初始值是 。重复第3步和第4步,可以得出运动估计预测值是 。第6步,因为每个方块(x,y)在每个子图像中,所以最后的运动矢量 的可以得出来。因此,在方块中,任何像素(x,y)的运动补偿预测不仅取决于这个方块的运动矢量,同时也取决于相邻方块的运动矢量。在固定的比特率的情况下,更多的比特可以分配到剩余的信息,以提高重建图像的质量。与此同时。由于估计只针对运动区域, 用于估计的时间消耗要少得多。对于运动区域这是成正比的,而不是对于整个图像。4、实验结果分析该实验条件如表1所示。克莱尔100帧 和美国小姐分别进行了测试。表.2显示运动矢量的字节(由运动矢量,字
12、节/帧表示)和在MRME/ MRMC补偿法和FDMRME方法中所消耗的运动估计的总时间(表示总时间,秒)通过利用的子带取向小波分解的选择性,使用FDMRME算法,改进低频率子带的基运动矢量的准确性。每个子带的运动矢量的错误率下降,这些证明了运动矢量的编码的有效性是有目共睹的。在CIF序列中美国小姐所消耗的时间比在QCIF序列中克莱尔所消耗的时间少。通过使用运动检测,视频帧分为运动区域,静止区域。编码器不会使运动估计和补偿成为静止区域,所以消耗的时间是由运动区域而不是由整个图像决定的,这导致系统效率的改进。表1 实验条件视频序列格式化帧 速度(F/s)低频率子块搜索最小分辨率的范围Claire
13、Miss AmericaQCIF CIF10102*22*28*810*10表2 运动估计值对比视频序列MRMEFDMRME运动矢量总时间运动矢量总时间Claire345.289.86296.538.76Miss America298.0810.97243.646.92表3显示了FDMRME方法的编码结果,用来测试序列。这里PSNR表示重建图像的质量,运动矢量代表的意思和上面的一样。ER(字节/帧)表示经过运动补偿之后,用于传输错误图像的字节数。TOTAL(字节/帧)表示ER字节和运动矢量的总字节数。从表3得出,在一定的比特率情况下,在FDMRME算法中,用于编码的运动矢量的比特要少得多。所以
14、更多的比特可以分配到运动补偿后的残差,以提高重建图像的质量。表3 编码结果对比视频序列MAREFDMAREPSNRER总和PSNRER总和Clair(BR/CR:118/65)39.123698402438-9439024125MissAmerica(R/CR:125/50)34.463903421635.93399644635、结论变量方块多分辨率运动估计和补偿是基于视频编码领域实现小波视频编码高效率的重要途径。通过分析MRME方法出现的问题,本文提出了通过运动检测的运动矢量部分以及运动估计过程的指导,促进了运动矢量估计的一致性和准确性,提高了编码效率,同时,降低了运动估计的复杂度。摘自:唐
15、国伟基于帧差的相邻预测的多分辨率小波域运动估计和补偿J电子学报(英文版)2009.5附:英文原文Multi-resolution Motion Estimation and Compensation based on Adjacent Prediction of Frame Difference in Wavelet DomainTang GuoweiAbstract:Aiming at the higher bitrate occupation of motion vector encoding and more time load of fullsearching strategies,
16、a multi-resolution motion estimation and compensation algorithm based on adjacent prediction of frame difference was proposed. Differential motion detection was employed to image sequences and proper threshold was ad opted to identify the connected region. Then the motion region was extracted to car
17、ry out motion estimation and motion compensation on itThe experiment results show that the encoding efficiency of motion vector is promoted, the complexity of motion estimation is reduced and the quality of the reconstruction image at the same bit-rate as Multi-Resolution Motion Estimation(MRME)is i
18、mproved.Key words: Motion estimation; Motion compensation; Multi-resolution analysis; Video codingI. IntroductionFor the excellent properties of time-frequency localization, wavelet analysis is widely used in the field of image/video coding. As to image sequences, motion estimation and motion compen
19、sation can effectively reduce the time-relativity and improve the encoding efficiency. But the traditional motion compensation wavelet coding, taking the structure of motion estimation plus intra-frame still image encoding, can not make full use of the advantages of the inherent multiresolution char
20、acteristics of wavelet decompositionIn 1992Zhang and S.Zafar proposed variable block Multi-Resolution Motion Estimation MRME algorithm for video compression which 1aid the foundation for motion estimation and compensation in wavelet domain.By using comparatively small searching window and matching b
21、lock, this method can reduce the amount of operation effectively, get rid of the blocking artifacts and be easy to achieve video scalable encoding suitable for human visual system and progressive transmission. But in the MRME algorithm there exist the defaults of discontinuous motion vector and the
22、inconformity of real object border with block border, which leads to the increase of high frequency components in transform coefficients and affects the encoding efficiency of Displacement Frame Difference DFD.Zan Jinwen proposed multi resolution motion estimation through median filtering which can
23、produce more smooth motion fields and result in a better estimation performance. But median filtering brings about a quite negative effect on the unsmooth motion of high frequency sub-band in high resolution. Y.C.Su made a theoretical deduction and a deep research to interpolation algorithm of wavel
24、et coefficients and proposed half-pixel multi-resolution motion compensation which improves the accuracy of motion estimation effectively. To overcome the shift-variant property of discrete wavelet transform, Lei Zhang proposed wavelet-domain motion estimation based on 2-channel high pass filtering
25、and subband-adapted central searching point predictionThis method has a rather low calculated complexity, but the performance of the coder is also reduced from the PSNR data. M.Cagnazzo studied the theoretical optimal criterion of wavelet-based video coding and proposed an optimal motion estimation
26、and compensation method, but at the cost of an augmented complexity.This paper presents a Frame Difference adjacency prediction MRME algorithm (FDMRME), which adopts differential motion detection for image sequence and extracts the motion region to carry out motion estimation and compensation. This
27、method reduces the complexity of motion estimation, improves the encoding efficiency of motion vector and raises the quality of the reconstruction image at the same bit-rate as MRME.II. Motion Detection Based On Frame Difference1. Three frame difference method Motion detection includes optical flow
28、algorithm, background elimination algorithm, adjacent frame algorithm and three frame difference algorithm. By using continuous three images to make difference operation and carry out AND operation to the difference results, the three frame difference algorithm can quickly detect the motion region f
29、rom image sequences. The detection procedure is shown in Fig.1.LL3 frame k-1Difference image g1(x,y)BinatizationDifference image g2(x,y)BinatizationExtraction of motion objectsDenoising andIdentification ofconnected regionAND operationkk+1Fig.1 The procedure of motion object detection using three fr
30、ame difference methodg1(x ,y)is the motion variation image of the first two frames, and g2(x ,y )is that of the later two frames. The motion information is included in both of g1(x ,y)and g2(x ,y). Binarize the two motion variation images and make AND operation for them to obtain the motion objects.
31、 2. Threshold selection of difference imagesIn order to extract motion objects. It is necessary to select proper threshold T for the frame difference images gl(x ,y)and g2(x ,y)by using a gray feature based approach and then to binarize the frame difference images by T . The process of thres- hold s
32、election consists of 4 steps:(1) Among the three frame images. Divide the first and the second frames to 2x2 blocksAdd the 4 pixels of each blocks in second frame to get a sum a , and add the 4 pixels of each corresponding blocks in first frame to get bi.Here mn are respectively the length and the w
33、idth of the image, and k is the number of the 22 blocks.(2) To binarize the frame difference image of the second and the first frames by using threshold T = 1.1S and get the binarization image. In this step a rough threshold is obtained on which a fined threshold can be obtained.(3) Compute the mean
34、 value of the pixels less than the threshold T in the frame difference image.Here q is the number of the pixels less than the threshold T in the frame difference image. Take M as the threshold of the frame difference image of the second and the first frames and then binarize the frame difference ima
35、ge.(4) To binarize the frame difference image of the third and second frames by threshold M. And then compute the mean value of the pixels less than M in the frame difference image. Take the mean value as the threshold of the frame difference image of the third and second frames and then binarize th
36、e frame difference image.3. Identification of the connected region and extraction of the coordinates of motion region. To label the AND operation images to get the coordinates of the motion region by using object clustering approach, which includes 2 steps:(1) Label each object pixels Scan the AND o
37、peration images from left to right and from top to bottom. When a pixel (gray-level is 1) belonging to object region is met, detect the 8 neighbors of this pixe1. If they are not labeled, label them with a new label number (start from 1), and each time the number increases by 1. Otherwise label the
38、current pixel with the smallest label number of the 8 neighboring pixels.(2) Cluster each object regionScan the motion object image from top to bottom line by line (from left to right and then from right to left).when an object pixel is met, detect the 8 neighbors of this pixe1. If the smallest labe
39、l number of them is greater than that of the object pixe1. It is substituted for the smallest labe1 number of the 8 neighboring pixels. When the whole image has been scanned., scan the image once more from bottom to top in the same way until all the labels of the object pixels stop changing.For head
40、-shoulder images there might be some irregular regions after the AND difference image has been processed by the object clustering procedure. Denoise these irregular regions and save them as the motion regions of the image.IIIDifference Adjacent Block Prediction Motion Estimation and CompensationStep
41、1 Make 3 level wavelet decomposition to the image sequences and conduct motion detection to the lowest frequency sub-band LL3 by using three frame difference method. Extract the motion regions and divide them into 22 blocks. Suppose the motion vector of each block as V3( x ,y),and the other 3 sub-ba
42、nd motion vectors are also ( x ,y). Define a reliability flag R. If not all the pixels in theblock share the same state, then it can be determined that this block lies in the border of the movement object which is not regarded as reliable to motion estimation.Step 2 Check the corresponding flags of
43、the pixels of the current block in 1ow frequency sub-band. If they belong to still region then R = 1. And the motion vectors are 0, which do not need to be estimated. And the motion vectors of the corresponding position in other sub-bands are also 0. Step 3 If all the current blocks of LL3 belong to
44、 motion region. Then R = 1. Predict according to the motion of the adjacent bocks. The relation between the block at( x ,y)with the adjacent bocks are shown in Fig.2.The motion estimation values are:Here the value of a1 is 0 or 1. If the corresponding block is located at the same region with the cur
45、rent block, then the value is 1. Otherwise it is 0. Determine whether the prediction value exceeds the bound or not. If not, then start searching bytaking the prediction value as the center. In order to promote the consistency, the Mean Absolute Difference (MAD) is taken as the matching criterion.12
46、34 Current blockFig.2 The sketch of current block R=1 and the adjacenciesStep 4 If there axe both motion pixels and still pixels in the current block. Then it can be inferred that the current block is located at the border of the motion region. The motion is less reliable and it can not be processed as the reliable blockIn order to promote the reliability of the prediction, more information needs to be obtained. So it will be processed later. W hen the first scanning is finished, all the motion vectors of reliable blocks have been achie