generic-library/vpx

Author	SHA1	Message	Date
James Zern	0f96f02b94	use LC_ALL=C to sort libvpx_*srcs.txt Change-Id: I387da141ebade4fc4d2f3c0a2b6aa5aaea091c0c	2015-02-26 20:48:40 -08:00
James Zern	099049efb3	tools_common.sh: use $$ in VPX_TEST_OUTPUT_DIR a bit simpler than invoking awk for rand() Change-Id: I36ac474708f7bf0157ae59b882c2a9f69b0aaf41	2015-02-26 18:30:24 -08:00
James Zern	f1e0183c81	Merge "tools_common.sh: add directory name to error output"	2015-02-26 15:54:41 -08:00
Marco	c3f7bb16b4	Fix arithmetic overflow warnings. Change-Id: Ib85b5bc135aa0907a76b8c74faafe577e27d014f	2015-02-26 15:27:21 -08:00
Jingning Han	8ec22296b3	Fix high bit-depth loop-filter sse2 compiling issue - part 3 Change-Id: Idb14b9a285f8098126f967c5e2750221d6a58f69	2015-02-26 15:21:22 -08:00
Yaowu Xu	776a6cfd9e	Merge "Fix the encoder to support profile change"	2015-02-26 15:13:28 -08:00
Yaowu Xu	d973fbb1de	Merge "Correct parameter order in a function call"	2015-02-26 15:13:06 -08:00
James Zern	28eebf3e36	Merge "tests: add a shorter 720p test clip"	2015-02-26 14:51:44 -08:00
Jingning Han	73a00d3219	Refactor integral projection based motion estimation Support variable block size integral projection based motion estimation. Change-Id: Iee6d65e44df4480aa13fb7b84b9c91914b89caa1	2015-02-26 14:48:59 -08:00
James Zern	2ebe0aee11	tools_common.sh: add directory name to error output + add a helper function to reduce the duplication this is a bit clearer when the environment variable is set, but the directory is missing Change-Id: I08f9b56122b5741bb40a5f795f7f82f5b49f1047	2015-02-26 12:57:30 -08:00
Jingning Han	14ff1cb74a	Fix high bit-depth loop-filter sse2 compiling issue - part 2 Change-Id: I6728b69bb3dff1daa64ff7142f691e80a089f1c4	2015-02-26 12:41:19 -08:00
Yaowu Xu	754bbcfdc8	Fix the encoder to support profile change Change-Id: Iefb928ad1174e274409facfb44f80265ff0f7683	2015-02-26 11:41:01 -08:00
Yaowu Xu	387bb8bed7	Correct parameter order in a function call Change-Id: Ibd87db1c4371edcbe193d39df2fdc07d3842c21a	2015-02-26 11:39:57 -08:00
paulwilkins	e2b4ef1313	Merge "Account for rate error in GF group Q calculation."	2015-02-26 08:20:08 -08:00
James Zern	7839d0382a	tests: add a shorter 720p test clip niklas_1280_720_30.y4m 60 frames @ 30fps only a small number of frames are being used; this reduces the test data download size in non-perf-test cases by >500M. retain niklas_1280_720_30.yuv for encode+decode perf tests Change-Id: I56b3433104acd462f952a9554280de5a3ec0b6d2	2015-02-25 19:12:03 -08:00
Alex Converse	6ea83fdfcb	Make SVC compatible with external resize. Fixes https://code.google.com/p/webm/issues/detail?id=943 Change-Id: I6177bf6ab6b31a22d2652732f579b8aed3f28887	2015-02-25 14:05:51 -08:00
Jingning Han	3e1d14a6ce	Merge "Motion compensated reference refinement"	2015-02-25 12:33:09 -08:00
Jingning Han	4c5a4efc38	Merge "Re-distribute hierarchical vector match pattern"	2015-02-25 10:33:25 -08:00
Jingning Han	b7050c0be3	Motion compensated reference refinement This commit applies one-step refinement search to the resulting motion vector of the integral projectiion based motion estimation, per 64x64 block. It improves the coding performance of speed -6. pedestrian 1080p 500 kbps 51735 b/f, 36.794 dB, 16044 ms -> 51382 b/f, 36.793 dB, 16282 ms cloud 1080p 500 kbps 24081 b/f, 37.988 dB, 14016 ms -> 23597 b/f, 38.076 dB, 12774 ms vidyo1 720p 1000 kbps 16552 b/f, 40.514 dB, 8279 ms -> 16553 b/f, 40.543 dB, 8510 ms The rtc set compression performance is improved by 0.5%. Change-Id: I3d09bea2caf58b2a4f3b38aa26fffafcbe9a2c17	2015-02-25 10:32:09 -08:00
Yaowu Xu	bcdac7b4be	Merge "Fix a trivial memory leak"	2015-02-25 10:26:44 -08:00
Yunqing Wang	419ff1352e	Merge "Fix ssse3 quantize_fp functions while skip=1"	2015-02-25 10:10:10 -08:00
Jingning Han	2080e4b206	Fix high bit-depth loop-filter sse2 compiling issue - part 1 The intrinsic statement _mm_subs_epi16() should take immediate. Feeding variable as its input argument will cause compile failure in older version gcc. Change-Id: I6a71efcc8d3b16b84715e0a9bcfa818494eea3f4	2015-02-25 09:59:50 -08:00
Jingning Han	0f57d0a682	Merge "Fix fwd transform sse2 build issue on older gcc version"	2015-02-25 09:32:00 -08:00
Jingning Han	e47033319d	Fix fwd transform sse2 build issue on older gcc version Change-Id: I3e0e53d129552babf29e6c5d047483733983973c	2015-02-24 23:25:21 -08:00
James Zern	044bfa3949	Merge "vp9_loopfilter: quiet integer constant size warnings"	2015-02-24 19:09:32 -08:00
Hanno Böck	b5d0a20170	Fix a trivial memory leak Change-Id: I1108d720bb3b30586b128dd01ce608e1e62b1756	2015-02-24 16:06:52 -08:00
Jingning Han	5b87f1bb5a	Fix high bit-depth loop-filter sse2 compiling issue - part 4 Change-Id: I39f56f60425836f2e1ec07da71edd4810a4c78bb	2015-02-24 14:50:30 -08:00
Jingning Han	f87e315e1e	Re-distribute hierarchical vector match pattern This commit modifies the hierarchical vector match patter. It avoids repeated SAD computation at same points. The function vp9_vector_sad_sse2 is called 12 times per 64x64 block, instead of 15 times as before. The effective coverage remains the same. Change-Id: I91ad9d27d40db8963c907d02af84e10702136994	2015-02-24 11:48:38 -08:00
James Zern	279d350f0b	vp9_loopfilter: quiet integer constant size warnings mark uint64_t constants with 'ULL' Change-Id: I7648e161b4004fba35e1fa7ab79e34cc19e39716	2015-02-24 11:13:16 -08:00
Yunqing Wang	58e0159c80	Fix ssse3 quantize_fp functions while skip=1 In ssse3 functions, DEFINE_ARGS macro hard codes qcoeff and dqcoeff to r3 and r4. If skip is 1, qcoeff and dqcoeff need to be loaded from the stack, which doesn't work because of the above definitions. Currently, skip=1 case is not used in the encoder. This patch fixed the issue, so it can be turned on later. Change-Id: I998d696b1a7a85dca2b3bcee790b21c21e039147	2015-02-24 10:37:05 -08:00
Yaowu Xu	6cf3031286	fix the propagation of color space info in decoder This addresses the issue #960 Change-Id: Iddf45b4bd4f53cb0ddfd879e800a071cd843b915	2015-02-23 13:01:14 -08:00
paulwilkins	8d7f53f04c	Account for rate error in GF group Q calculation. When GF group adaptive maxQ is enabled this patch accounts somewhat for accumulated error in the rate control. This improves accuracy quite a bit on many clips especially when there is overshoot. Examples when the overshoot and undershoot command line parameters are set to 100: Hall @ 1200 overshoot is reduced from 67-24%. Akiyo @ 400 undershoot is reduced from 28%-15%. Setting a lower value for undershoot or overshoot still reduces the error further. Impact on metrics is mixed with some gains in average psnr but generally a little lower (e.g. 0.5%) on overall and ssim. The GF group adaptation is still off by default in this patch. Compared to with the head, enabling this mode now gives big average psnr gains on the YT sets (e.g. YT_HD >11.2%), a drop in overall PSNR (YT-HD 3.9%) and a smaller drop or neutral for SSIM. Change-Id: If4b32cd0740d3fb941317b374f9c2951954eee90	2015-02-23 10:57:27 +00:00
Adrian Grange	44adb8e283	Merge "Remove redundant test"	2015-02-20 16:13:55 -08:00
Marco	c9f660d895	Merge "Remove a few unneccessary multiplications in denoiser."	2015-02-20 14:42:02 -08:00
Marco	8f84fbe756	Remove a few unneccessary multiplications in denoiser. Change-Id: I3edbb7cc67203fbbf32c6fd4a08015ca9d9ed53e	2015-02-20 11:55:11 -08:00
Hangyu Kuang	8724d31d12	Move dequant table from VP9_COMMON to VP9_COMP as decoder does not need it any more. This reduces VP9_COMMON size from 25776 bytes to 17584 bytes(~31%). Change-Id: Ic5daea732ccefb6d512b048af7983f0efe08589b	2015-02-20 11:12:42 -08:00
Marco	a1b402e71c	Merge "Adjustments to cyclic refresh (aq-mode=3)."	2015-02-20 09:55:05 -08:00
Jingning Han	6728655422	Merge "Add high bit depth support to rtc sub8x8 block coding"	2015-02-20 09:35:18 -08:00
Marco	0187f4b411	Adjustments to cyclic refresh (aq-mode=3). Target higher delta-qp for big blocks with zero motion, and for segment#1: avoid 64x64 partition size and force 8x8 tx size. Metrics on RTC set mostly positive: SSIM up by ~4%, PSRN by ~1.5%. Doesn't seem to be any change in speed. Change-Id: I1f68fa3c4f62dab3b90cc58041f05ebb048ae5ac	2015-02-20 08:47:59 -08:00
Jingning Han	6f4245894a	Add high bit depth support to rtc sub8x8 block coding This commit adds proper buffer handle to support high bit depth in rtc sub8x8 block coding. Change-Id: Ibaf8a2160194121aec9ca68b8094817fed9ccaea	2015-02-20 08:36:33 -08:00
Hangyu Kuang	a28a8cb726	Merge "Optimize the dequantization process on decoder side."	2015-02-20 08:23:54 -08:00
Adrian Grange	f03627347e	Merge "Fix control string in firstpass stats fprintf"	2015-02-19 16:36:43 -08:00
Hangyu Kuang	bdd249be31	Optimize the dequantization process on decoder side. Change-Id: I00621ff7165bbe86a18794b4a816976c9effaf78	2015-02-19 15:43:15 -08:00
Yunqing Wang	5e57729601	Merge "Improve skip_txfm thresholds in the non-rd mode selection"	2015-02-19 15:31:02 -08:00
Adrian Grange	2ae314fe3a	Fix control string in firstpass stats fprintf 20 items in the control string but only 19 arguments. Change-Id: I51dab9aa1c58c653b52395005a9cb41f09feb484	2015-02-19 15:18:30 -08:00
Jingning Han	216b171d63	Merge "Integral projection based motion estimation"	2015-02-19 15:08:11 -08:00
Yunqing Wang	81fc5bf81c	Improve skip_txfm thresholds in the non-rd mode selection Modified the thresholds of deciding whether or not to skip the transforms in model_rd_for_sb_y(). Used zbin[] instead of dequant[] to be more precise. Also, modified the checking coditions. Rtc set borg test results (at speed 6) showed: average PSNR gain: 0.138%, overall PSNR gain: 0.158%, and SSIM gain: 0.177%. The data rate test was modified slightly as suggested by Marco. Change-Id: Ieaf633ab77f4838cb3c45cf69065b29d55f8ae6c	2015-02-19 14:30:46 -08:00
Jingning Han	ed2dc59c1b	Integral projection based motion estimation This commit introduces a new block match motion estimation using integral projection measurement. The 2-D block and the nearby region is projected onto the horizontal and vertical 1-D vectors, respectively. It then runs vector match, instead of block match, over the two separate 1-D vectors to locate the motion compensated reference block. This process is run per 64x64 block to align the reference before choosing partitioning in speed 6. The overall CPU cycle cost due to this additional 64x64 block match (SSE2 version) takes around 2% at low bit-rate rtc speed 6. When strong motion activities exist in the video sequence, it substantially improves the partition selection accuracy, thereby achieving better compression performance and lower CPU cycles. The experiments were tested in RTC speed -6 setting: cloud 1080p 500 kbps 17006 b/f, 37.086 dB, 5386 ms -> 16669 b/f, 37.970 dB, 5085 ms (>0.9dB gain and 6% faster) pedestrian_area 1080p 500 kbps 53537 b/f, 36.771 dB, 18706 ms -> 51897 b/f, 36.792 dB, 18585 ms (4% bit-rate savings) blue_sky 1080p 500 kbps 70214 b/f, 33.600 dB, 13979 ms -> 53885 b/f, 33.645 dB, 10878 ms (30% bit-rate savings, 25% faster) jimred 400 kbps 13380 b/f, 36.014 dB, 5723 ms -> 13377 b/f, 36.087 dB, 5831 ms (2% bit-rate savings, 2% slower) Change-Id: Iffdb6ea5b16b77016bfa3dd3904d284168ae649c	2015-02-19 13:47:19 -08:00
Jingning Han	83559e7357	Fix a check condition in nonrd_pick_partition Change-Id: Ic92fb4b16948f745c218351b24fdafecf9abce3a	2015-02-19 09:54:55 -08:00
hkuang	02bd4edc2a	Merge "Fix the frame parallel invalid file test failure on ARM."	2015-02-18 14:09:28 -08:00

... 4 5 6 7 8 ...

13025 Commits