generic-library/vpx

Author	SHA1	Message	Date
Scott LaVarnway	95390b2b20	Moving ymode_count, uv_mode_count to macroblock struct Change-Id: Ib73c7b2bee4cb2eb2528fa6b381fffe9503079a0	2012-11-05 12:25:18 -08:00
Scott LaVarnway	03c0af8747	Moved skip_true_count to macroblock struct Change-Id: Ie9a26be7c9baa54a0e43a63ed6c77f2746477a9c	2012-11-05 11:02:35 -08:00
Scott LaVarnway	7ee44eef13	Moving coef_counts to macroblock struct Change-Id: I289564a5a27f0d03ddc6f19c7838542ff22719be	2012-11-05 11:00:49 -08:00
Scott LaVarnway	9016fa22e0	Removed unnecessary VP8_COMP *cpi parameters Code cleanup Change-Id: I82f9d787a2f511d39895fd8dfd5347a1676d9dbc	2012-11-02 12:10:07 -07:00
Scott LaVarnway	c533f2a43e	Merge "Added error checking to vp8cx_create_encoder_threads()"	2012-08-23 13:55:06 -07:00
Yunqing Wang	8ee88da060	Fix inter_zz_count calculation bug The current way of counting inter_zz_count doesn't work correctly in multi-threaded encoding. Calculating it after the frame is encoded fixed the problem. Change-Id: Ifcb1972cde950b8cc194f75c6d7b6af09e8b0e65	2012-08-20 17:43:06 -07:00
Scott LaVarnway	d1b6fa4161	Added error checking to vp8cx_create_encoder_threads() Added checks for pthread_create() errors. Change-Id: Ie198ef5c14314fe252d2e02f7fe5bfacc7e16377	2012-08-14 12:00:23 -07:00
Attila Nagy	03b3fcecac	Fix potential encoder dead-lock after picture resize The sync interval for the multithreaded encoder was considered as not changing during the encoding. This is not true if picture size is changed. The encoder could dead-lock because the main thread and the other threads were using different sync interval. Change-Id: I75232bbdbc6c02d77f830d870fd8b4e96697c64e	2012-08-01 08:28:54 +03:00
Attila Nagy	e66e9ddfb4	Optimizes updates of encoder block ptrs Precalculated block ptrs do not need updates during encoding. Set these at init stage. Moved the allocation of 'mt_current_mb_col' (last encoded MB on each row) to vp8_alloc_compressor_data(), so that it is correctly reallocated when frame size is changing. Change-Id: Idcdaa2d0cf3a7f782b7d888626b7cf22a4ffb5c1	2012-07-27 09:59:09 +03:00
John Koleszar	0164a1cc5b	Fix pedantic compiler warnings Allows building the library with the gcc -pedantic option, for improved portabilty. In particular, this commit removes usage of C99/C++ style single-line comments and dynamic struct initializers. This is a continuation of the work done in commit `97b766a46`, which removed most of these warnings for decode only builds. Change-Id: Id453d9c1d9f44cc0381b10c3869fabb0184d5966	2012-06-11 15:14:58 -07:00
John Koleszar	44d35f7b25	Merge branch 'origin/eider' into master Conflicts: vp8/common/entropymode.c vp8/common/entropymode.h vp8/encoder/encodeframe.c vp8/vp8_cx_iface.c Change-Id: I708b0f30449b9502b382e47b745d56f5ed2ce265	2012-05-11 10:51:05 -07:00
John Koleszar	22f56b93e5	Formalize encodeframe.c forward delclarations Change If4321cc5 fixed a bug caused by forward declarations not being kept in sync across C files, resulting in a function call with the wrong arguments. The commit moves the affected function declarations into a header file, along with the other symbols from encodeframe.c that were being sloppily shared. Change-Id: I76a7b4c66d4fe175f9cbef7e52148655e4bb9ba1	2012-05-04 10:44:47 -07:00
Attila Nagy	3e32105d63	Fix multi-resolution threaded encoding mb_row and mb_col was not passed to vp8cx_encode_inter_macroblock in threaded encoding. Change-Id: If4321cc59bf91e991aa31e772f882ed5f2bbb201	2012-05-04 10:44:46 -07:00
Attila Nagy	357800e7cd	Fix multi-resolution threaded encoding mb_row and mb_col was not passed to vp8cx_encode_inter_macroblock in threaded encoding. Change-Id: If4321cc59bf91e991aa31e772f882ed5f2bbb201	2012-05-04 13:32:43 +03:00
Attila Nagy	b41c17d625	Shares one set of RD costs tables between all encoding threads RD costs were local to MACROBLOCK data and had to be copied all the time to each thread's MACROBLOCK data. Tables moved to a common place and only pointers are setup for each encoding thread. vp8_cost_tokens() generates 'int' costs so changed all types to be int (i.e. removed unsigned). NOTE: Could do some more cleaning in vp8cx_init_mbrthread_data(). Change-Id: Ifa4de4c6286dffaca7ed3082041fe5af1345ddc0	2012-04-23 14:15:23 -04:00
Attila Nagy	52cf4dcaea	Packing bitstream on-the-fly with delayed context updates Produce the token partitions on-the-fly, while processing each MB. Context is updated at the beginning of each frame based on the previoud frame's counters. Optimally encoder outputs partitions in separate buffers. For frame based output, partitions are concatenated internally. Limitations: - enabled just in combination with realtime-only mode - number of encoding threads has to be equal or less than the number of token partitions. For this reason, by default the encoder will do 8 token partitions. - vpxenc supports partition output (-P) just in combination with IVF output format (--ivf) Performance: - Realtime encoder can be up to 13% faster (ARM) depending on the number of threads and bitrate settings. Constant gain over the 5-16 speed range. - Token buffer reduced from one frame to 8 MBs Quality: - quality is affected by the delayed context updates. This again dependents on input material, speed and bitrate settings. For VC style input the loss seen is up to 0.2dB. If error-resilient=2 mode is used than the effect of this change is negligible. Example: ./configure --enable-realtime-only --enable-onthefly-bitpacking ./vpxenc --rt --end-usage=1 --fps=30000/1000 -w 640 -h 480 --target-bitrate=1000 --token-parts=3 --static-thresh=2000 --ivf -P -t 4 -o strm.ivf tanya_640x480.yuv Change-Id: I127295cb85b835fc287e1c0201a67e378d025d76	2012-02-29 12:13:37 -05:00
Attila Nagy	78071b3b97	Multithreaded encoder, late sync loopfilter Second shot at this... Sync with loopfilter thread as late as possible, usually just at the beginning of next frame encoding. This returns control to application faster and allows a better multicore scaling. When PSNR packets are generated the final filtered frame is needed imediatly so we cannot delay the sync. Same has to be done when internal frame is previewed. Change-Id: I64e110c8b224dd967faefffd9c93dd8dbad4a5b5	2012-02-16 12:26:39 +02:00
Scott LaVarnway	11c706488b	Removed frames_till_alt_ref_frame from MACROBLOCKD Change-Id: Ieb05270ac332a4cc38ec4b7b995fc0150e0fffdf	2012-02-02 13:34:13 -05:00
Scott LaVarnway	e2000cc5ca	Removed frames_since_golden from MACROBLOCKD Change-Id: I10efa441d663fceb6bc97a3bfad518cd3d9a5128	2012-02-02 13:28:41 -05:00
John Koleszar	109b69a706	RTCD: add arnr functions This commit continues the process of converting to the new RTCD system. It removes the last of the VP8_ENCODER_RTCD struct references. Change-Id: I2a44f52d7cccf5177e1ca98a028ead570d045395	2012-01-30 12:10:48 -08:00
John Koleszar	510e0ab467	RTCD: add FDCT functions This commit continues the process of converting to the new RTCD system. Change-Id: I3f9c07db65eb206f6363d21bdb80e871570da767	2012-01-30 12:10:42 -08:00
John Koleszar	fdb61a4531	RTCD: add recon functions This commit continues the process of converting to the new RTCD system. Change-Id: I9bfcf9bef65c3d4ba0fb9a3e1532bad1463a10d6	2012-01-30 12:08:28 -08:00
Scott LaVarnway	5f25d4c175	Reduced the size of Y1Dequant and friends to [128][2] This patch removes the local copies of the dequantize constants and implements John's idea as described in "Make a local copy of the dequantized data" commit. Change-Id: Ic6b7d681f00bf63263f71ff1e39ab2f80729e8b2	2012-01-06 11:12:00 -08:00
John Koleszar	3cb92b85b9	Remove unused MACROBLOCK member vector_range Change-Id: Ie2dc0d72363ff38e0f71b59f6e2d1a2d70c5266b	2011-12-28 14:58:38 -08:00
John Koleszar	31e86192ba	Remove unused BLOCK member force_empty Change-Id: I72ed49ce14ca0124dd0d31bfcf4c7630a4681587	2011-12-28 13:57:51 -08:00
John Koleszar	f56918ba9c	Remove legacy integer types Remove BOOL, INTn, UINTn, etc, in favor of C99-style fixed width types. Change-Id: I396636212fb5edd6b347d43cc940186d8cd1e7b5	2011-12-22 09:58:40 -08:00
Yunqing Wang	06fc0f83b6	Populate q_index in multi-thread encoding This value needs to be copied to each thread's data structure. This fixed artifact problem in multi-thread encoder. Change-Id: Iab6d9745a1d44846aa503184705376f63a505597	2011-11-28 15:58:28 -05:00
Yunqing Wang	4c14efd234	Fix checks in MB quantizer initialization vp8cx_mb_init_quantizer() needs to be called at least once to get all values calculated. This change added one check to decide if we could skip initialization or not. Change-Id: I3f65eb548be57580a61444328336bc18c25c085b	2011-11-08 12:11:48 -05:00
Johann	a82cc0205d	remove unused variable warning Change-Id: I4fcd6e4656d9823aead941616cd63501aecbd6e2	2011-10-24 16:33:45 -07:00
Scott LaVarnway	5bc7b3a68e	Fixed encoder crash caused by the "Removed bmi copy to/from BLOCKD" commit. Change-Id: I9fae71bdc34c8ecc07bb81cd3ccf498b91ce3ec7	2011-09-13 11:46:33 -04:00
Yunqing Wang	0d87098e08	Copy macroblock data to a buffer before encoding it I got this idea from Pascal (Thanks). Before encoding a macroblock, copy it to a 16x16 buffer, and then read source data from there instead. This will help keep the source data in cache, and help with the performance. Change-Id: Id05f4cb601299150511d59dcba0ae62c49b5b757	2011-06-23 13:54:02 -04:00
Paul Wilkins	4e81a68af7	Further activity masking changes: Some further re-structuring of activity masking code. Still has various experimental switches. Supports a metric based on intra encode. Experimental comparison against a fixed activity target rather than a frame average, for altering rd and zbin. Overall the SSIM performance is similar to TT's original code but there is a much smaller PSNR hit of circa 0.5% instead of 3.2% Change-Id: I0fd53b2dfb60620b3f74d7415e0b81c1ac58c39a	2011-06-08 16:03:37 +01:00
Johann	04edde2b11	Merge "neon fast quantize block pair"	2011-06-06 13:42:58 -07:00
Scott LaVarnway	773768ae27	Removed B_MODE_INFO Declared the bmi in BLOCKD as a union instead of B_MODE_INFO. Then removed B_MODE_INFO completely. Change-Id: Ieb7469899e265892c66f7aeac87b7f2bf38e7a67	2011-06-02 13:46:41 -04:00
Tero Rintaluoma	61f0c090df	neon fast quantize block pair vp8_fast_quantize_b_pair_neon function added to quantize two adjacent blocks at the same time to improve performance. - Additional 3-6% speedup compared to neon optimized fast quantizer (Tanya VGA@30fps, 1Mbps stream, cpu-used=-5..-16) Change-Id: I3fcbf141e5d05e9118c38ca37310458afbabaa4e	2011-06-01 10:48:05 +03:00
Scott LaVarnway	cfab2caee1	Removed unused variable warnings Change-Id: I6e5e921f03dc15a72da89a457848d519647677a3	2011-05-24 15:17:03 -04:00
Scott LaVarnway	e11f21af9a	MODE_INFO size reduction Declared the bmi in MODE_INFO as a union instead of B_MODE_INFO. This reduced the memory footprint by 518,400 bytes for 1080 resolutions. The decoder performance improved by ~4% for the clip used and the encoder showed very small improvements. (0.5%) This reduction was first mentioned to me by John K. and in a later discussion by Yaowu. This is WIP. Change-Id: I8e175fdbc46d28c35277302a04bee4540efc8d29	2011-05-24 13:24:52 -04:00
John Koleszar	048497720c	Remove unused members of VP8_COMP Various members that were either completely unreferenced or written and not read. Change-Id: Ie41ebac0ff0364a76f287586e4fe09a68907806e	2011-05-19 15:49:09 -04:00
Paul Wilkins	ff52bf3691	Restructure of activity masking code. This commit restructures the mb activity masking code to better facilitate experimentation using different metrics etc. and also allows for adjustment of the zero bin either for encode only or both the encode and mode selection stages It also uses information from the current frame rather than the previous frame and the default strength has been reduced. Change-Id: Id39b19eace37574dc429f25aae810c203709629b	2011-05-13 10:37:50 +01:00
Yaowu Xu	1bcf4e66bb	Merge "fix a bug related to gf_active_flags in multi-threaded encoder"	2011-05-10 19:59:52 -07:00
Yaowu Xu	89c6017cc0	fix a bug related to gf_active_flags in multi-threaded encoder Paul pointed out that the pointer to the gf_active_flags is not being properly incremented in multithreaded encoder. This commit fixes the issue by making sure the gf_active_ptr points to the starting of next group of mb rows. Change-Id: I3246e657d23beabb614dfb880733a68a5fd7e34c	2011-05-06 09:00:44 -07:00
Aron Rosenberg	eeb8117303	Fix semaphore emulation on Windows The existing emulation of posix semaphores on Windows uses SetEvent() and WaitForSingleObject(), which implements a binary semaphore, not a counting semaphore as implemented by posix. This causes deadlock when used with the expected posix semantics. Instead, this patch uses the CreateSemaphore() and ReleaseSemaphore() calls (introduced in Windows 2000) which have the expected behavior. This patch also reverts commit `eb16f00`, which split a semaphore that was being used with counting semantics into two binary semaphores. That commit is unnecessary with corrected emulation. Change-Id: If400771536a27af4b0c3a31aa4c4e9ced89ce6a0	2011-05-06 00:13:59 -04:00
Yunqing Wang	eb16f00cf2	Fix rare hang in multi-thread encoder on Windows This patch is to fix a rare hang in multi-thread encoder that was only seen on Windows. Thanks for John's help in debugging the problem. More test is needed. Change-Id: Idb11c6d344c2082362a032b34c5a602a1eea62fc	2011-05-05 10:42:29 -04:00
Yunqing Wang	aeb86d615c	Merge "Runtime detection of available processor cores."	2011-05-05 04:59:54 -07:00
Yunqing Wang	3d6815817c	Use full-pixel MV in mvsadcost calculation MV sad cost error is only used in full-pixel motion search, which only need full-pixel resolution instead of quarter-pixel resolution. This change reduced mvsadcost table size, and removed unneccessary pamameter passing since this table is constant once it is generated. Change-Id: I9f931e55f6abc3c99011321f1dfb2f3562e6f6b0	2011-04-01 16:41:58 -04:00
Attila Nagy	297b27655e	Runtime detection of available processor cores. Detect the number of available cores and limit the thread allocation accordingly. On decoder side limit the number of threads to the max number of token partition. Core detetction works on Windows and Posix platforms, which define _SC_NPROCESSORS_ONLN or _SC_NPROC_ONLN. Change-Id: I76cbe37c18d3b8035e508b7a1795577674efc078	2011-03-31 10:23:01 +03:00
Attila Nagy	bfe803bda3	Fix multithreaded encoding for 1 MB wide frame Thread synchronization was not correct when frame width was 1 MB. Number of allocated encoding threads is limited by the sync_range. There is no point having more because each thread lags sync_range MBs behind the thread processing the row above. http://code.google.com/p/webm/issues/detail?id=302 Change-Id: Icaf67a883beecc5ebf2f11e9be47b6997fdf6f26	2011-03-18 12:35:30 +02:00
Attila Nagy	3ae2465788	Encoder loopfilter running in its own thread In multithreaded mode the loopfilter is running in its own thread (filter level calculation and frame filtering). Filtering is mostly done in parallel with the bitstream packing. Before starting the packing the loopfilter level has to be calculated. Also any needed reference frame copying is done in the filter thread. Currently the encoder will create n+1 threads, where n > 1 is the number of threads specified by application and 1 is the extra filter thread. With n = 1 the encoder runs in single thread mode. There will never be more than n threads running concurrently. Change-Id: I4fb29b559a40275d6d3babb8727245c40fba931b	2011-03-11 10:52:51 +02:00
John Koleszar	02321de0f2	Fix relative include paths Allow compiling without adding vp8/{common,encoder,decoder} to the include paths. Change-Id: Ifeb5dac351cdfadcd659736f5158b315a0030b6c	2011-02-10 15:09:44 -05:00
Gaute Strokkenes	315e3c2518	Put more code under #if CONFIG_MULTITHREAD. Change-Id: Icf4b692099d7d249fe3553852b1022b027b28e4b	2011-02-09 11:21:18 -05:00

1 2

69 Commits