openh264

Author	SHA1	Message	Date
volvet	b3fa8dd334	Merge pull request #418 from mstorsjo/ios-neon-detection Use the __ARM_NEON__ built-in compiler define for identifying neon capability on iOS	2014-03-07 09:15:17 +08:00
Ethan Hugg	2f2801dc78	Merge pull request #434 from mstorsjo/threadlib-core-count-android Use the cpu-features NDK library for detecting the number of cores in WelsThreadLib	2014-03-06 08:02:28 -08:00
Ethan Hugg	bd8b6af2b8	Merge pull request #436 from mstorsjo/fix-endif-comment Correct the endif comment	2014-03-06 08:01:28 -08:00
Martin Storsjö	11bdebb12c	Explicitly enable the UAL syntax when using gnu tools Arm assembly has got two variants of the syntax, the old legacy syntax, and the new modern UAL (unified assembly language) syntax. Most arm assembly is the same in the both syntaxes, but some uncommon cases change the order of suffixes - the "subscs" instruction would be written "subcss" in the old syntax. The apple tools default to UAL, while the GNU tools (e.g. in android) require you to specify ".syntax unified" to enable the new syntax. When enabling the new syntax with the GNU tools, some cases of "sub r0, r1, lsl #1" needs to be written explicitly as "sub r0, r0, r1, lsl #1", handled in the previous commit. This allows using the same, modern syntax for things like subscs, without needing to have two alternate forms of writing it.	2014-03-06 16:21:54 +02:00
Martin Storsjö	c0043f7053	Use the three-operand form of add/sub with shift When using unified syntax, the two operand form with a shift isn't allowed.	2014-03-06 16:21:54 +02:00
Martin Storsjö	f1502c26e3	Don't use WELS_ASM_FUNC_END in the middle of a function WELS_ASM_FUNC_END declares the end of the function, and needs to be paired with WELS_ASM_FUNC_BEGIN.	2014-03-06 16:21:54 +02:00
Martin Storsjö	8ba79262bf	Rename a function to avoid conflicts between almost duplicate neon functions There's a different version of the same function in the encoder, but they're not identical - the encoder version has got stricter alignment requirements. If someone can confirm that it is ok to use the function from the encoder, pixel_sad_neon.S in processing could be deleted, and the encoder version moved to codec/common instead.	2014-03-06 16:19:48 +02:00
Martin Storsjö	4e4bfcc1bc	Regenerate makefiles to include the encoder arm assembly	2014-03-06 16:11:54 +02:00
Martin Storsjö	45e059ec5f	Rename expand_picture.S to expand_picture_neon.S This avoids ambiguity in the make based build system about whether expand_picture.o should be built from expand_picture.S or expand_picture.asm.	2014-03-06 16:11:40 +02:00
Martin Storsjö	ce4fa9e272	Correct the endif comment The code block is about HAVE_NEON, not X86_ASM.	2014-03-06 15:43:04 +02:00
Martin Storsjö	636df2bebb	Use WelsMultipleEventsWaitSingleBlocking within the worker thread on unix as well This avoids using a separate thread for handling pUpdateMbListEvent events, and later allowing using the encode exit event on unix instead of pthread cancellation.	2014-03-06 15:34:35 +02:00
Martin Storsjö	801da26d1d	Use WelsMultipleEventsWaitSingleBlocking with a master event for waiting on finished threads This allows using the same codepath for both unix and windows for distributing new slices to code to threads. This also improves the performance on unix - instead of waiting for all the current threads to finish their current slice before handing out a new slice to each of them (where the threads that finish first will just wait instead of immediately getting a new slice to work on), we now use the same logic as on windows. In one setup, it improves the performance of encoding from ~920 fps to ~950 fps, and in another setup it goes from ~390 fps to ~660 fps. (These tests were done with the SM_ROWMB_SLICE mode, which heavily exercises the code for distributing new slices to the worker threads.) The extra WelsEventSignal call on windows where it isn't strictly necessary doesn't incur any measurable slowdown, so it is kept without any extra ifdefs to keep the code more readable and unified.	2014-03-06 15:33:37 +02:00
Martin Storsjö	276b585f03	Use the cpu-features NDK library for detecting the number of cores in WelsThreadLib On arm, the exact same detection is done in WelsCPUFeatureDetect, but in the x86 version of that function we use x86 cpuid for getting the core count, and this is not available on all processors. For the case when cpuid can't tell the core count, use the NDK function as higher level API. The thread lib itself doesn't build properly on android yet, but will do so soon.	2014-03-06 15:28:59 +02:00
Martin Storsjö	d0a81355b0	Add support for using a separate "master event" in WelsMultipleEventsWait*Blocking This allows making the WelsMultipleEventsWaitSingleBlocking function work properly in unix, without polling. If a master event is provided, the function first waits for a signal on that event - once such a signal is received, it is assumed that one of the individual events in the list have been signalled as well. Then the function can proceed to check each of the semaphores in the list using sem_trywait to find the first one of them that has been signalled. Assuming that the master event is signalled in pair with the other events, one of the sem_trywait calls should succeed. The same master event is also used in WelsMultipleEventsWaitAllBlocking, to keep the semaphore values in sync across calls to the both functions.	2014-03-06 15:03:59 +02:00
Martin Storsjö	de32455d87	Remove the timeout parameter from WelsMultipleEventsWaitSingleBlocking All users of the function passed the value corresponding to "infinite", and the (currently unused) unix implementation of it only supported infinite wait as well.	2014-03-06 15:03:59 +02:00
Licai Guo	201ab42d7e	Merge pull request #431 from huili2/large_to_small_sps_bug Large to small sps bug for issue #373	2014-03-06 16:51:59 +08:00
volvet	8cc332dea1	Merge pull request #432 from zhilwang/arm-asm Arm asm	2014-03-06 16:50:56 +08:00
volvet	73452e0993	Merge pull request #429 from mstorsjo/simplify-ifdef-with-macro Use a macro for conditionally logging based on ENABLE_TRACE_MT	2014-03-06 16:01:41 +08:00
Licai Guo	8a7a9195d9	remove unused function	2014-03-05 23:14:57 -08:00
Licai Guo	4260a9b2ba	remove CS and RS syntaxs for issue 373	2014-03-05 22:59:27 -08:00
Licai Guo	7bfe801874	Remove trailing space	2014-03-06 14:55:36 +08:00
Licai Guo	67534b0fc0	arm asm code refine.	2014-03-06 14:30:16 +08:00
Martin Storsjö	30fff7ece0	Allow building plain armeabi binaries for android Building with "make OS=android APP_ABI=armeabi" will produce arm binaries that will conform to the armeabi ABI - not using any features outside of armv5te by default (but still optionally using the NEON functions at runtime if detected - even though such devices should rather use the default armeabi-v7a build).	2014-03-06 08:15:55 +02:00
Licai Guo	36ae8d9f6c	use stlport to replace libgnuc++, this remove GCCVERSION variable	2014-03-05 22:15:36 -08:00
Martin Storsjö	fd6f8a83b3	Use a macro for conditionally logging based on ENABLE_TRACE_MT This avoids having an extra ifdef around every single WelsLog call.	2014-03-06 08:06:34 +02:00
ruil2	28a56a6752	Merge pull request #415 from volvet/remove-useless-mgs-code remove un-supported mgs code	2014-03-06 14:05:04 +08:00
ruil2	8313e015a8	Merge pull request #427 from volvet/clean-encode-cfg clean encode cfg files	2014-03-06 14:04:48 +08:00
ruil2	a59c8ea04c	Merge pull request #428 from sijchen/read_para3 [Encoder Console] add new para reading to get accord with the new API design	2014-03-06 14:04:33 +08:00
volvet	54c63e2547	Merge pull request #423 from licaiguo/refine-android-build-pr rebase on latest code, refine android build	2014-03-06 13:55:16 +08:00
volvet	a17dd825a5	clean encode cfg files	2014-03-06 13:48:31 +08:00
sijchen	a4a8eddb04	add new para reading to get accord with the new API design	2014-03-06 13:48:18 +08:00
volvet	d292179095	Merge pull request #424 from ruil2/encoder_interface remove inter-deblock related parameters--review request #148	2014-03-06 13:39:57 +08:00
volvet	50fe120a3e	simplify-layer-process	2014-03-06 11:19:33 +08:00
ruil2	f176bf5637	modify welsenc.cfg for parameters update	2014-03-06 10:28:36 +08:00
ruil2	334c5765c7	remove inter-deblock related parameters	2014-03-06 10:26:53 +08:00
Licai Guo	585c526b1f	rebase on latest code, refine android build	2014-03-05 17:32:03 -08:00
volvet	8beb3c8c09	Merge pull request #417 from mstorsjo/unify-event-init Unify the interface for creating/deleting event objects	2014-03-06 09:13:13 +08:00
Licai Guo	61dae45fad	Merge pull request #421 from mstorsjo/android-x86-build Fix building android on x86	2014-03-06 09:07:26 +08:00
Ethan Hugg	4f535e31e6	Merge pull request #420 from mstorsjo/simplify-x86-asm-flags Don't add -DNO_DYNAMIC_VP to ASMFLAGS	2014-03-05 07:11:41 -08:00
Ethan Hugg	4b97d2d48f	Merge pull request #414 from mstorsjo/unix-newlines Convert encoder config files to unix newlines	2014-03-05 07:08:48 -08:00
volvet	97376c6339	Merge pull request #413 from mstorsjo/remove-commented-code Remove commented out, unused code	2014-03-05 22:13:35 +08:00
Martin Storsjö	506826a8ae	Fix building android on x86 In `70360cb11`, the ASM variable was moved to the x86-common file even though the android build file didn't include neither platform-arch.mk nor platform-x86-common.mk. Instead of explicitly declaring ASM here, include platform-arch.mk and remove setting of the flags that platform-arch.mk sets. This is inspired by and based on a patch by Licai Guo.	2014-03-05 15:19:18 +02:00
Martin Storsjö	0df3a068ba	Don't add -DNO_DYNAMIC_VP to ASMFLAGS None of the assembly source uses this define for anything.	2014-03-05 15:02:53 +02:00
volvet	7ea70491c8	Merge pull request #411 from mstorsjo/arm-add-func-markers Add .func/.endfunc markers in the arm assembly	2014-03-05 17:40:18 +08:00
Martin Storsjö	f384dde881	Add .func/.endfunc markers in the arm assembly This adds information to debug builds. This requires adding a separate definition of WELS_ASM_FUNC_END for apple tools.	2014-03-05 11:25:51 +02:00
Licai Guo	eb3dd8f0ae	Merge pull request #416 from huili2/move_iTotalNumMbRec_to_pCtx move iTotalNumMbRec from refpic to ctx	2014-03-05 17:12:43 +08:00
Licai Guo	e7cc8c2780	Add arm asm code for processing.	2014-03-05 16:54:05 +08:00
Martin Storsjö	ef7e05d47d	Use the __ARM_NEON__ built-in compiler define for identifying neon capability on iOS This avoids having to hardcode the names of devices that don't support neon. The devices that don't support neon don't run the armv7 variants of iOS binaries at all - they would need to be built for the armv6 architecture. (Building for armv6 isn't supported at all in modern iOS SDKs.) Therefore we can simply use the __ARM_NEON__ built-in compiler define to check if NEON code is allowed in the current build, and have the WelsCPUFeatureDetect function return flags accordingly. The only thing this disallows is doing an armv6 build which would optionally enable neon code at runtime if run on an armv7 capable device, but since Apple allows you to build the same binary for armv7 separately in the same app bundle, and since armv6 building isn't even possible in the current iOS SDKs, this isn't really a loss. This is in contrast to the android builds where the armv7 baseline does not include NEON.	2014-03-05 09:47:05 +02:00
Martin Storsjö	d4bdef2916	Use an event name that contains the process id This reduces the risk for namespace collisions if two processes run the encoder simultaneously without address space layout randomization.	2014-03-05 09:36:46 +02:00
Martin Storsjö	4814d5828d	Use unnamed semaphores on linux This avoids the risk of namespace collisions for named semaphores (where the names are global for the whole machine), on platforms where we strictly don't need to use the named semaphores.	2014-03-05 09:36:46 +02:00

... 9 10 11 12 13 ...

1478 Commits