isa-l

mirror of https://github.com/intel/isa-l.git synced 2024-12-14 02:05:11 +01:00

Author	SHA1	Message	Date
Jerry Yu	183385f02f	multibinary: Add run-time cpu feature detect for aarch64 Some CPUs report "illegal instruction" error for the crc test because they do not support the relevant optional feature . This can be fixed by introducing CPU feature detection for AArch64 . The difference with the x86 implementation is the dispatcher . It is based on the glibc function `getauxval(AT_HWCAP)` and `getauxval(AT_HWCAP2)` , not registers or instructions . On a heterogeneous system (big.LITTLE) , it is dangerous to detect CPU features using identification registers . And while it is possible to use architectural feature registers from userspace on recent kernels, this won't necessarily work with older platforms . Thus we use the HW_CAPs exported from the kernel (and visible in getauxval) as the solution. - According to kernel suggestion , getauxval should be used for this purpose . - [CPU Feature detection](https://github.com/torvalds/linux/blob/master/Documentation/arm64/cpu-feature-registers.rst) - According to AAPCS result/paramter registers should be saved/restore for function call - [AAPCS](http://infocenter.arm.com/help/topic/com.arm.doc.ihi0055b/IHI0055B_aapcs64.pdf) - [GLibc](https://sourceware.org/git/gitweb.cgi?p=glibc.git;a=blob;f=sysdeps/aarch64/dl-trampoline.S) Signed-off-by: Jerry Yu <jerry.h.yu@arm.com> Change-Id: Ic9abe0d2268ac95537e1abf10acc642fc58a5054	2019-08-26 17:58:42 +08:00
Greg Tucker	4ac0e435eb	ec: Fix incorrect min size stated for gf_vect_mad Change-Id: If178913f01f0d500aa66ce0e8dd67aaba49a0871 Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2019-07-16 15:41:34 -07:00
Yibo Cai	57eed2f02b	aarch64: Cleanup build issues This patch addresses one build failure and fixes several build warnings for Arm (some for x86 too). - Fix dynamic relocation link failure of ld.bfd 2.30 on Arm [log] relocation R_AARCH64_ADR_PREL_PG_HI21 against symbol `xor_gen_neon' which may bind externally can not be used when making a shared object - Add arch dependent "other_tests" to exclude x86 specific tests on Arm [log] isa-l/erasure_code/gf_2vect_dot_prod_sse_test.c:181: undefined reference to `gf_2vect_dot_prod_sse' - Check "fread" return value to fix gcc warnings on Arm and x86 [log] warning: ignoring return value of ‘fread’, declared with attribute warn_unused_result [-Wunused-result] fread(in_buf, 1, in_size, in_file); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ - Fix issue of comparing "char" with "int" on Arm. "char" is unsigned on Arm by default, an unsigned char will never equal to EOF(-1). [Log] programs/igzip_cli.c:318:31: warning: comparison is always true due to limited range of data type [-Wtype-limits] while (tmp != '\n' && tmp != EOF) ^~ - Include <stdlib.h> to several files to fix build warnings on Arm [log] igzip/igzip_inflate_perf.c:339:5: warning: incompatible implicit declaration of built-in function ‘exit’ exit(0); ^~~~ Change-Id: I82c1b63316b634b3d398ffba2ff815679d9051a8 Signed-off-by: Yibo Cai <yibo.cai@arm.com>	2019-03-20 10:15:40 +08:00
Greg Tucker	e08dfab9b3	test: Fix c99 warn in perf helper functions Change-Id: I7e116215dc95bbca96c7285b98f5b8ec4e340ef1 Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2019-03-18 15:07:47 -07:00
Roy Oursler	699bb5bd3f	all: Revamp performance testing to be time based Change-Id: I6260d28e4adc974d8db0a1c770e3eb922d87f8e4 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2019-03-07 09:28:04 -07:00
Roy Oursler	aae6e29d28	igzip: Remove unaligned stores Change-Id: I8d351c8b7153178d26d6fc702ee3036b71165b93 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2019-03-07 09:27:50 -07:00
Roy Oursler	a3169750b5	mem: Remove unaligned loads in base function Change-Id: I8fb0f2e2e372485c864d5c60f816b661a865b707 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2019-03-07 09:27:50 -07:00
Roy Oursler	5be1ba2215	igzip: Remove undefined unaligned loads Change-Id: I02591d958f8691d07b261218cf5ab361e8ad36c9 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2019-03-07 09:27:50 -07:00
Zhiyuan Zhu	636272cff6	aarch64: Fix dynamic lib call crash If an application treats these functions as function pointers, and this lib (isa-l) is compiled into solib, a segmentation fault may occur. For example: Ubuntu 16.04 on arm64 platfrom will be crash, because the linker does not know that this symbol is a function, so mark the function type explicitly with %function to solves this issue. Change-Id: Iba41b1f1367146d7dcce09203694b08b1cb8ec20 Signed-off-by: Zhiyuan Zhu <zhiyuan.zhu@arm.com>	2019-03-01 02:55:50 +00:00
Zhiyuan Zhu	f5aa9d72de	raid: Add license headers Change-Id: I0d2d48eb30c31ff6967c132a415431dddd8a8982 Signed-off-by: Zhiyuan Zhu <zhiyuan.zhu@arm.com>	2019-02-22 14:32:19 -07:00
Zach Bjornson	f9588bbedc	igzip: export isal_adler32 Change-Id: Iadb73851f826131cc59974b65240b501e9d57f98 Signed-off-by: Zach Bjornson <zbbjornson@gmail.com>	2019-02-10 13:37:52 -07:00
Yibo Cai	19fb012e81	raid: Add aarch64 NEON implementation Change-Id: I6ad471d3b22a87bfa7e216713e04afa990a90edb Signed-off-by: Yibo Cai <yibo.cai@arm.com>	2019-02-10 13:08:56 -07:00
John Kariuki	2393791654	build: Add multi-arch autoconf support Added multi-arch support to configure.ac. Updated header files to only export sse and avx functions on x86 Change-Id: I4d1f8d0eccabad55ee887dc092a565c468f5c629 Signed-off-by: John Kariuki <John.K.Kariuki@intel.com>	2018-12-10 13:40:55 -07:00
Roy Oursler	ba1a000680	igzip: Implement set_long_icf to compare more than 258 Change-Id: Ia8813d176da6bfcd3c6ef441eca1c59ac99db7f2 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-12-04 10:41:40 -07:00
Greg Tucker	eaa1c18a94	doc: Fix spelling errors in headers Change-Id: I0f4164b39b185fa808c66208df0731b5e031d7fd Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2018-12-04 10:12:14 -07:00
Greg Tucker	e19101f5de	doc: Add detail of internal checksum value in gzip/zlib mode Change-Id: I8f7fdcec40371e61eb19248cb24c9837d0845a0c Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2018-12-04 10:11:40 -07:00
Greg Tucker	2e212f28fa	build: Fix for mac nasm lack of symbol types Change-Id: I9ee86a3e32876d3860477c8365fc459d94a8920e Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2018-11-29 13:54:36 -07:00
Greg Tucker	06b926fbb6	igzip: Fix portability issue when bad window size passed If a user passes an invalid size for window bits it could have triggered an undefined shift by larger than variable size. Change-Id: Ib2999b094af075596be3333418667ae9b498e2ae Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2018-10-25 14:43:27 -07:00
John Kariuki	6e2013391a	mem: Add zero detect memory functions This patch introduces the base, avx and sse optimized zero detect memory function. The zero detect memory function tests if a memory region is all zeroes. If all the bytes in the memory region are zero, the function return a zero. Otherwise, if the memory region has non zero bytes, the zero detect function returns a 1. Change-Id: If965badf750377124d0067d09f888d0419554998 Signed-off-by: John Kariuki <John.K.Kariuki@intel.com>	2018-09-25 14:33:31 -07:00
Roy Oursler	8e4f1a1a38	igzip: Create generic deflate performance utility Change-Id: Idf180660797f97a492550fb557652f036cd55509 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-09-20 11:12:02 -07:00
Roy Oursler	ff1928e8ec	igzip: Create functions to write gzip/zlib headers Change-Id: If5aaa277a01214bd36406ee11680df0904ad12f7 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-09-20 11:12:02 -07:00
Roy Oursler	43374f6776	igzip: Implement gzip/zlib header/trailer parsing Change-Id: I3fe8653f2286212a9d6c6ecfa3b78752b2cac8ef Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-09-20 11:12:02 -07:00
Roy Oursler	11542000f0	igzip: Implement limited window size for inflate Change-Id: Ib7fce6a51db99fc7e11f06f5916c2b755bfc5c67 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-09-20 11:12:02 -07:00
Roy Oursler	03bef684a4	igzip: Setup for variable hash mask Change-Id: I3be94dbc40c2e02dcff4f89e5a9df8ed1f744f02 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-09-18 14:27:25 -07:00
Roy Oursler	6317ce2b78	igzip: Setup for variable lookback distance Change-Id: Idd52c9392113dfc54feea3c66916a7f5aa128bef Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-09-18 14:27:25 -07:00
Roy Oursler	bb3c6c28c8	igzip: Remove need for total_in_start Change-Id: Ie9ab3e702ce07a5ba8d6fb3275da98e03c25822b Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-08-21 13:18:05 -07:00
Greg Tucker	a1869430c7	build: Fix warnings on mac for objects defining no symbols Change-Id: I13ef334ae23a3370cbf2a5409974fa0dc9fba7a5 Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2018-06-21 14:39:08 -07:00
Greg Tucker	da1aee8714	igzip: Remove references to types.h Previously included just for struct alignment but all restrictions have been removed. Change-Id: I3fa7cbab86fce419b3b3bfccb48d9129bd77cf64 Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2018-06-21 14:39:08 -07:00
Roy Oursler	2de5a0fd88	igzip: Swap length code lookup with length lookup Change-Id: I9f1c3ea5353f2c2fa98bab1d0cb1eb3c7b7397f6 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-06-14 15:30:14 -07:00
Roy Oursler	222a68f760	igzip: Implement multibyte decode Change-Id: I923a57a01f696f2082945fafcc2773655b9a5beb Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-06-14 15:30:14 -07:00
Roy Oursler	7a1dc55c27	igzip: Increase size of large short code lookup Change-Id: I05a564d1759ae417a966f3a12621799db0edf80a Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2018-06-14 15:30:14 -07:00
Daniel Verkamp	99b45db17e	types.h: remove [U]INT{8,16,32,64} typedefs These can be replaced with the <stdint.h> types. Additionally, the existing definitions weren't correct on some platforms (e.g. IA-32, where 'long', used for INT64/UINT64, is only 32 bits). Change-Id: I1d9235c693ca2dc0c51d085128cecc4effc165fd Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-03-29 09:35:25 -07:00
Roy Oursler	48119c5c87	igzip: Increase long_code_lookup struct size to fix buffer overflow Change-Id: I6546dcb7ffcd5895292d06fdc748c3cf279a4542	2017-12-19 10:24:23 -07:00
Greg Tucker	491035d956	crc: Add t10dif+copy function Change-Id: Ic6c424a0aa746aa06643575f7fcc8d6944cbfc0e Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2017-12-18 15:59:17 -07:00
Roy Oursler	8965584ae3	igzip: Implement large hash table method as level 2 Move current level 2 compression to be level 3 and create a level 2 compression which is the same algorithm as level 1 but with a larger hashtable. Change-Id: I398a6017c2ff218dded24e229fb51c23ea69f314 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-12-15 14:27:14 -07:00
Roy Oursler	7a12bcb2a8	igzip: Separate concept of level and compression method Change-Id: I82a5fbeb93adc77057893c643e044e311e4f393c Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-12-15 14:27:14 -07:00
Roy Oursler	fe68f02dac	igzip: Move hash_table and symbol histogram for icf compression Change-Id: I50df9c8915ff3e1af450aeb8e4c0cc3baf9624ae Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-12-15 14:27:14 -07:00
Roy Oursler	06cd70481c	igzip: Separate defines for LVL1 and LVL0 hash tables Change-Id: I19bdec8d2d0c74083bc1695763c9630516995885 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-12-15 14:27:14 -07:00
Roy Oursler	4ae2d1be29	igzip: Implement optimized level 2 compression Change-Id: I8cf5bcd56f290d17205ac36dc2828c8acfc66947 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-12-15 14:27:14 -07:00
Roy Oursler	49156c3568	igzip: Implement block buffer Change-Id: I9d7942740557e4ffaf8e223e190f4bd4e4f47b1e Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-12-15 14:27:14 -07:00
Roy Oursler	6049ce3ca7	igzip: Modify state to record total_in on deflate call Change-Id: I13e5878a227732545aee5a762bf5a9a75ce73f02 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-12-15 14:27:14 -07:00
Roy Oursler	59b9990a39	igzip: Remove file_start from zstate Change-Id: Ia4cb6dc86da54cc771f25a6d958bea730caa4801 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-12-15 14:27:14 -07:00
Roy Oursler	cad5e7d479	ec: Fix doxygen comment format for gen_rs_matrix Change-Id: I505155b0d57576814d876cd2d3595e522e03c469 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-10-17 10:18:45 -07:00
Roy Oursler	80bfbb33df	igzip: Remove DECLARE_ALIGNED and optimize structure layout Change-Id: I95bc3b8e2e30aff0d596c743158337400c4eb486 Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-08-18 14:35:44 -07:00
Xiaodong Liu	3ab8239097	multibinary: move WRT_OPT macro to common header Signed-off-by: Xiaodong Liu <xiaodong.liu@intel.com>	2017-08-18 14:24:57 -07:00
Roy Oursler	34c341db35	igzip: Add reset functions for both deflate and inflate. Change-Id: I8677a4365ac5c2343751660176f3b2eb4746ddfe Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-06-26 04:14:44 -04:00
Xiaodong Liu	7137c4a5be	crc: release crc32_gzip_refl code outside Merge crc32_gzip_refl function definitions, base code, multi-binary code into crc32.h, crc32_base,c and crc_multibinary.asm in order to keep consistency. Add crc32_gzip_refl files into crc/Makefile.am Original crc32_gzip_refl removed NOT operation, re-add it. Change-Id: Ib0cbbeb1ab3c9fcafec324b392596d2514202424 Signed-off-by: Xiaodong Liu <xiaodong.liu@intel.com>	2017-06-26 04:14:27 -04:00
Roy Oursler	ed15402f5b	igzip: Add stateful dictionary support Change-Id: I75dbac947787bc0041674468c88d0aa41b8b082f Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-06-26 04:11:48 -04:00
Roy Oursler	82a6ac65dc	ec: Determine exact conditions where gf_gen_rs_matrix works Add a program calculating some of the exact conditions where gf_gen_rs_matrix works, add comments stating these bounds to gf_gen_rs_matrix, and fix erasure code test that violates the bounds. Change-Id: I1d0010b09fea97731bfd24f4f76e24609538b24f Signed-off-by: Roy Oursler <roy.j.oursler@intel.com>	2017-06-26 04:11:12 -04:00
Greg Tucker	fc1467deb2	Format only patch from iindent and remove_whitespace Change-Id: I114bfcfa8750c7ba3a50ad2be9dd9e87cb7a1042 Signed-off-by: Greg Tucker <greg.b.tucker@intel.com>	2017-06-26 04:10:47 -04:00

1 2

92 Commits