generic-library/vpx

Author	SHA1	Message	Date
hui su	be407e001b	Adjust optimize_b RD parameters Coding gain lowres 0.51% midres 0.36% Change-Id: I1e9f2f9341bad12d9023f97c73d0e991ae5ec7f0	2016-05-06 09:56:59 -07:00
hui su	5af706210b	Enable optimize_b for intra blocks Coding gain lowres 0.13% midres 0.36% Change-Id: Ia65d2d7513685dfcffbe3939fea473fb6d7036b9	2016-05-06 09:55:45 -07:00
Debargha Mukherjee	7a2934550b	Merge "Quantization Profiles Strictly on Entropy Context" into nextgen	2016-05-02 19:15:13 +00:00
Brandon Young	43195061b7	Quantization Profiles Strictly on Entropy Context Allow for 3 quant profiles from entropy context Refactored dq_offset bands to allow for re-optimization based on number of quantization profiles Change-Id: Ib8d7e8854ad4e0bf8745038df28833d91efcfbea	2016-05-01 12:25:57 -07:00
Sarah Parker	b8bf8085be	Merge "Search parameter space around gm parameters" into nextgen	2016-04-26 22:08:20 +00:00
Sarah Parker	ba6423c564	Make feature based motion estimation the default Optical flow parameters still need to be tweaked and is much slower so feature based should be the default for now. Change-Id: Id6cafb5a245e329f728e9c66c89c0ed1018c347c	2016-04-21 11:50:05 -07:00
Sarah Parker	7fa7f4c14a	Fix comipler warnings in opticalflow.c Change-Id: I4561c2676d8a7793ade47e1995e026ba9d521fdd	2016-04-20 17:27:13 -07:00
Sarah Parker	de085787ec	Search parameter space around gm parameters For each global motion parameter, search some step size to the left and right to try and minimize warp mse and correct for error. Change-Id: I1f0b464b0924d56b15460e509f89736af9b7e78f	2016-04-13 17:56:25 -07:00
Brandon Young	50619bacfd	Fix error with cumbins to allow multiple profiles Change-Id: I23aadc8f7551771197b55465a3264250b40838ff	2016-04-08 13:16:34 -07:00
Debargha Mukherjee	a399281daa	Merge "Framework to incorporate switchable dequant levels" into nextgen	2016-03-14 16:34:04 +00:00
Debargha Mukherjee	b8a7d1fe02	Framework to incorporate switchable dequant levels Changed experiment to allow switchable QUANT_PROFILES Change-Id: I8e5e76239a4103273e9ef759d46400104ef55599	2016-03-11 23:07:26 -08:00
Yunqing Wang	efb93f518f	Merge "Implement a tile copying method in large-scale tile coding" into nextgen	2016-02-05 23:10:08 +00:00
Yunqing Wang	6e4dff92a4	Implement a tile copying method in large-scale tile coding A tile copy mode is introduced, while allows a tile to use another tile's coded data directly at bitstream level. This largely reduces the bit rate in this use case. Our tests showed that 10% - 20% bit rate reduction was achieved. Change-Id: Icf5ae00320e27193b15ce95297720f8b6f5e7fd9	2016-02-01 11:08:30 -08:00
Sarah Parker	36cbc27587	Merge "Optical flow for computing global motion params" into nextgen	2016-01-25 22:06:32 +00:00
Sarah Parker	a7aab42915	Optical flow for computing global motion params Addressing comments, fixing style issues Change-Id: I91b72ab5cdf80d68476858f442616ab3af41e709	2016-01-22 11:29:33 -08:00
Debargha Mukherjee	b765dc8e39	Merge "Changes to CONFIG_NEW_QUANT experiment." into nextgen	2016-01-22 00:15:05 +00:00
Brandon Young	9c9beb1c40	Changes to CONFIG_NEW_QUANT experiment. Added dq_off_index attribute to mbmi to allow for switching between dequantization modes. Reduced number of different dequantization modes from 5 to 3. Changed dequant_val_nuq to be allow for 3 dequant levels instead of 1. Fixed lint errors Change-Id: I7aee3548011aa4eee18adb09d835051c3108d2ee	2016-01-19 15:59:50 -08:00
Debargha Mukherjee	c20adb59b2	Merge "Rework wedge experiment to improve speed." into nextgen	2016-01-14 16:13:38 +00:00
Brandon Young	839ab586f6	Merge "Simplify new_quant experiment" into nextgen	2016-01-14 00:51:56 +00:00
Debargha Mukherjee	f93bdebcf9	Rework wedge experiment to improve speed. Implements wedge mask generation by pre-computing arrays. Improves encode speed by 15-20%. Also consolidates the mask generation code for inter-inter and inter-intra. Change-Id: If1d4ae2babb2c05bd50cc30affce22785fd59f95	2016-01-12 15:23:54 -08:00
Yunqing Wang	15a90668a5	Adaptively determine the number of bytes used for tile-data size transmission In large-scale tile coding, when the number of tiles is large and the tile size is small, using a fixed number of bytes in the tile header to store tile-data size information as done in current VP9 codec would bring high overhead for each tile. This patch implemented 2 ways to lower that overhead and adaptively determine the number of bytes needed for tile-data size transmission. The test on a test clip having the tile size of 64x64 showed that the number of bytes used for storing tile-data size was reduced from 4 to 1, which substantially improved the compression ratio in large-scale tile coding. Change-Id: Ia02fc43eda67fa252fbc2554321957790f53f6fd	2016-01-11 12:46:04 -08:00
Brandon Young	604b909e06	Simplify new_quant experiment Changes number of non-uniform bins from 5 to 3. Changed all references to 'KNOTES' to 'KNOTS'. Change-Id: If39d7cef00dbe07307035e44cb29194d0783b475	2015-12-17 12:25:27 -08:00
Debargha Mukherjee	e6790e30c5	Replace DST1 with DST2 for ext-tx experiment A small gain (0.1 - 0.2%) with this experiment on derflr/hevcmr. The DST2 can be implemened very efficiently using sign flipping of odd indexed inputs, followed by DCT, followed by reversal of the output. This is how it is implemented in this patch. SIMD optimization is pending. Change-Id: Ic2fc211ce0e6b7c6702974d76d6573f55cc4da0e	2015-12-14 13:54:41 -08:00
Julia Robson	711600e5f1	SSSE3 optimisations of high bit depth masked variance functions Includes tests which compare output of new SSSE3 functions with their C equivalents, and fixes to the C code to ensure these tests pass. Change-Id: Iec3980cce95a8ee6bf9421fa4793130e92c162e3	2015-12-04 11:59:30 -08:00
Julia Robson	997595e8a6	Code refactoring to remove duplication in masked sub pixel var fns Change-Id: Idcc93ff94ed2c68367225b6bebfcae0b7952f074	2015-12-03 17:57:35 +00:00
Julia Robson	ea167a5855	Adding SSSE3 accelerations of masked SAD functions Includes tests of masked SAD function optimisations against C versions Change-Id: I42f198767a113b58ae9456841f4ec71075591720	2015-12-01 09:55:24 -08:00
Julia Robson	8e4d779137	SSSE3 optimisations of masked variance function (8bit ONLY) Includes test which compares output of new SSSE3 functions with their C equivalents Change-Id: I4488cd7672cdb57efff93c0b3b8bff07f07ec544	2015-12-01 12:07:22 +00:00
Julia Robson	ef01ea152d	Changes so other expts work with 128x128 coding unit size expt Changes ensure wedge_partition, interintra and palette expts all work alongside the ext_coding_unit_size experiment. Change-Id: I18f17acb29071f6fc6784e815661c73cc21144d6	2015-11-26 17:43:00 +00:00
Julia Robson	d52cc33891	Changes to allow 128x128 to work with copy_mode Change-Id: I1fff46e7733e5e1d15c012d6a204536243c6e090	2015-11-24 10:15:10 -08:00
Debargha Mukherjee	0c29af2767	Merge last4_ref with multi-ref experiment Change-Id: I9778670ff6d02fb021099b17f292edadf87603e6	2015-11-13 14:54:38 -08:00
Debargha Mukherjee	ec6d31781d	Merge "Remove dst1 config option and merge with ext-tx" into nextgen	2015-11-13 21:01:21 +00:00
Debargha Mukherjee	3a45a1edfd	Remove dst1 config option and merge with ext-tx Change-Id: I0152ed352ae2a0a725a508b5c209ef2c1dc2302d	2015-11-13 11:24:38 -08:00
Julia Robson	3d9133b2a5	SSE2 optim of vp9_subtract_block for 128x128 units Extending the SSE2 implementation of vp9_subtract_block to work with the 128x128 coding unit experiment Change-Id: Ib3cc16bf5801ef2c7eecc19d3cc07a8c50631580	2015-11-13 11:12:56 -08:00
Debargha Mukherjee	3436acb347	Merge "Adding encoder support for 128x128 coding units" into nextgen	2015-11-13 18:52:09 +00:00
Debargha Mukherjee	a542190830	Merge "Eliminate copying for FLIPADST in fwd transforms." into nextgen	2015-11-13 18:23:49 +00:00
Geza Lore	177ad11981	Eliminate copying for FLIPADST in fwd transforms. This is a port of 01bb4a318dc0f9069264b7fd5641bc3014f47f32 This commit also fixes a bug where FLIPADST transforms when combined with a DST (that is FLIPADST_DST and DST_FLIPADST) did not actually did a flipped transform but a straight ADST instead. This was due to the C implementation that it fell back on not implementing flipping. This is now fixed as well and FLIPADST_DST and DST_FLIPADST does what it is supposed to do. Change-Id: I89c67ca1d5e06808a1567c51e7d6bec4998182bd	2015-11-13 09:34:26 -08:00
Debargha Mukherjee	59de0c0bc7	Adding encoder support for 128x128 coding units Changes to allow the encoder to make use of 128x128 coding units. Change-Id: I340bd38f9d9750cb6346d83885efb00443852910	2015-11-13 09:21:22 -08:00
Debargha Mukherjee	9d9962aec9	Some fixes on context size for the 128x128 expt Change-Id: I56f050502e3a750ce74b196d033b780218df2c1f	2015-11-13 07:06:19 -08:00
Julia Robson	d90a3265f0	Changes to use defined constants rather than hard-coded numbers Also fixes a valgrind error when optimizations are disabled. Done in preparation for the work on the extended coding unit size experiment. Change-Id: Ib074c5a02c94ebed7dd61ff0465d26fa89834545	2015-11-12 15:42:32 -08:00
Julia Robson	598a11d04a	Fixing issue with calculation of block_idx For tall rectangular blocks, the block_idx of the lower transform block was being mis-calculated. Does not affect results the way this function is being used now. Change-Id: I470464d19be0bf0f42003d0cc29793bc42db8f52	2015-11-12 08:21:53 -08:00
Debargha Mukherjee	9fc51184b7	Merge "Speed up of wedge search" into nextgen	2015-10-06 00:33:50 +00:00
Debargha Mukherjee	597204298a	Speed up of wedge search Speeds up wedge search by pre-calculating single predictions before computing the wedge combination. About 20% speed up achieved. Change-Id: I72f76568559d1899c8ac0afaa133d433ba388e6d	2015-10-04 23:43:25 -07:00
Debargha Mukherjee	ff9aa146cb	Reimplementatio of dst1 for speed Encoder with --enable-ext-tx --enable-dst1 is now 4 times faster. Change-Id: Ia750ad3516698ce94da4ceb566b1c51539537a95	2015-10-02 11:06:55 -07:00
Zoe Liu	90dede3f5b	Merge "Merged LAST2 and LAST3 to one experiment MULTI_REF" into nextgen	2015-09-25 16:47:28 +00:00
Zoe Liu	e8e58402f0	Merged LAST2 and LAST3 to one experiment MULTI_REF Change-Id: I220be17af317520dccb62fa6b19da5c7ce10652d	2015-09-24 17:08:15 -07:00
Zoe Liu	a9900eb2b1	Merge "Added another reference frame LAST4_FRAME" into nextgen	2015-09-24 15:58:22 +00:00
Zoe Liu	829dbf7a79	Added another reference frame LAST4_FRAME Under the experiment of CONFIG_LAST4_REF. On derflr testset, using highbitdepth (HBD), in average PSNR, (1) LAST2+LAST3+LAST4 obtained +0.361% against LAST2+LAST3; (2) LAST2+LAST3+LAST4 obtained +1.567% against baesline. Change-Id: Ic8b14272de6a569df2b54418fa72b505e1ed3aad	2015-09-23 17:10:44 -07:00
hui su	71d0af90f8	Adjust rd calculation in choose_tx_size_from_rd Change-Id: I3649f28196a87663b116b9fe6446b1fbe6eeab4a	2015-09-23 14:37:45 -07:00
Zoe Liu	9144967ca8	Fixed a couple of bugs for LAST3 Change-Id: I63126a844c255df4a447aac7f630ba54cc7d7d7a	2015-09-21 11:50:35 -07:00
Zoe Liu	c0889b9a8c	Added a 3rd reference frame LAST3_FRAME Under experiment CONFIG_LAST3_REF, which can only be turned on when the experiment of CONFIG_MULTI_REF is on, i.e. LAST3_FRAME can only be used when LAST2_FRAME is used. CONFIG_LAST3_REF would most likely be combined with CONFIG_MULTI_REF once the performance improvement is further confirmed. On the testset of derflr, using Average PSNR metrics, with HighBitDepth (HBD) on: (1) LAST2 HBD obtained +0.579% against base HBD; (2) LAST2 + LAST3 HBD obtained +0.591% against LAST2 HBD; (3) LAST2 + LAST3 HBD obtained +1.173% against base HBD. Change-Id: I1aa2b2e2d2c9834e5f8e61bf2d8818c7b1516669	2015-09-18 15:25:46 -07:00

1 2 3 4 5 ...

4774 Commits