- c593e9e Fix typos by Mike Taves · 8 months ago
- 6331da9 fixing a lot of typos by Frédéric Chapoton · 8 months ago
- fb95e90 Add truncation op by Charles Schlosser · 11 months ago
- 888fca0 Simd sincos double by Damiano Franzò · 12 months ago
- be06c9a Implement float pexp_complex by Damiano Franzò · 1 year, 2 months ago
- 7fd7a3f Implement plog_complex by Damiano Franzò · 1 year, 4 months ago
- a73970a Fix arm32 issues. by Antonio Sánchez · 1 year, 3 months ago
- f38e16c Apply clang-format by Tobias Wood · 1 year, 4 months ago
- dd8c71e Fix typecasting for arm32 by Antonio Sánchez · 1 year, 5 months ago
- 66b9f4e Fix (u)int64_t->float conversion on arm by Pavel Labath · 1 year, 5 months ago
- 81b4806 Fix arm32 float division and related bugs by Charles Schlosser · 1 year, 7 months ago
- 6e4d5d4 Add IWYU private pragmas to internal headers. by Antonio Sánchez · 1 year, 8 months ago
- 7465b76 Disable FP16 arithmetic for arm32. by Antonio Sánchez · 1 year, 10 months ago
- 59b3ef5 Partially Vectorize Cast by Charles Schlosser · 1 year, 10 months ago
- df1049d Small packet math cleanup. by Rasmus Munk Larsen · 2 years ago
- 2c8011c Fix arm builds. by Antonio Sánchez · 2 years, 1 month ago
- 21c49e8 Delete mystery character from Eigen/src/Core/arch/NEON/MathFunctions.h by Rasmus Munk Larsen · 2 years, 1 month ago
- d6235d7 Clean up generic packetmath specializations for various backends with the help of a macro. by Rasmus Munk Larsen · 2 years, 1 month ago
- ce62177 Vectorize atanh & add a missing definition and unit test for atan. by Rasmus Munk Larsen · 2 years, 2 months ago
- 3842699 More NEON packetmath fixes. by Antonio Sánchez · 2 years, 2 months ago
- 2dfbf1b Fix NEON make_packet2f. by Antonio Sánchez · 2 years, 2 months ago
- 0a5392d Fix MSVC arm build. by Antonio Sánchez · 2 years, 2 months ago
- d70b486 issue #2581: review and cleanup of compiler version checks by Sean McBride · 2 years, 3 months ago
- c54785b Fix error: unused parameter 'tmp' [-Werror,-Wunused-parameter] on clang/32-bit arm by Martin Burchell · 2 years, 3 months ago
- 311cc0f Enable NEON pcmp, plset, and complex psqrt by Arthur · 2 years, 4 months ago
- bb6675c Fix incorrect NEON native fp16 multiplication. by Antonio Sánchez · 2 years, 4 months ago
- c4fb6af Enable NEON pabs for unsigned int types by Arthur Feeney · 2 years, 4 months ago
- d194167 Fix the bug using neon instruction fmla for data type half by Lianhuang Li · 2 years, 4 months ago
- 9b6d624 fix neon by Charles Schlosser · 2 years, 5 months ago
- 7e398e9 Add missing return keyword in psignbit for NEON. by Rasmus Munk Larsen · 2 years, 5 months ago
- 82b152d Add signbit function by Charles Schlosser · 2 years, 5 months ago
- c475228 Vectorize atan() for double. by Rasmus Munk Larsen · 2 years, 6 months ago
- ed8cda3 Move EIGEN_NEON_GEBP_NR macro to the right place in GeneralBlockPanelKernel.h by Rasmus Munk Larsen · 2 years, 7 months ago
- e2ea866 Add a macro to set the nr trait in the BEBP kernel for NEON. by Rasmus Munk Larsen · 2 years, 7 months ago
- 2329963 Use 3px8/2px8/1px8/1x8 gebp_kernel on arm64-neon by Lianhuang Li · 2 years, 7 months ago
- bd393e1 Vectorize acos, asin, and atan for float. by Rasmus Munk Larsen · 2 years, 7 months ago
- 9b9496a Revert "Add AVX512 optimizations for matrix multiply" by Antonio Sánchez · 2 years, 11 months ago
- 25db0b4 Add AVX512 optimizations for matrix multiply by aaraujom · 2 years, 11 months ago
- f3ba220 Remove EIGEN_EMPTY_STRUCT_CTOR by Tobias Schlüter · 3 years ago
- f1b9692 Removed EIGEN_UNUSED decorations from many functions that are in fact used by Sean McBride · 3 years, 1 month ago
- 28e008b Fix sqrt/rsqrt for NEON. by Antonio Sánchez · 3 years, 2 months ago
- 6b60bd6 Fix 32-bit arm int issue. by Antonio Sánchez · 3 years, 2 months ago
- 8d81a23 Reduce usage of reserved names by Kolja Brix · 3 years, 3 months ago
- c4b1dd2 Add support for Cray, Fujitsu, and Intel ICX compilers by Matthias Möller · 3 years, 3 months ago
- cc11e24 Some further cleanup by Erik Schultheis · 3 years, 4 months ago
- 6bb6a6b Vectorize fp16 tanh and logistic functions on Neon by Alex Druinsky · 3 years, 5 months ago
- afa616b Fix some typos found by Kolja Brix · 3 years, 7 months ago
- d7d0bf8 Issue an error in case of direct inclusion of internal headers. by Rasmus Munk Larsen · 3 years, 7 months ago
- ff07a8a GCC 4.8 arm EIGEN_OPTIMIZATION_BARRIER fix (#2315). by Antonio Sanchez · 3 years, 7 months ago
- ab28419 optimize predux if architecture is aarch64 by Han-Kuan Chen · 3 years, 8 months ago
- 66ca41b Add support for vectorizing logical comparisons. by derekjchow · 3 years, 9 months ago
- 9312a5b Implement a generic vectorized version of Smith's algorithms for complex division. by Rasmus Munk Larsen · 3 years, 9 months ago
- c81da59 Correct declarations for aarch64-pc-windows-msvc by 大河メタル · 3 years, 9 months ago
- bffd267 Small cleanup: Get rid of the macros EIGEN_HAS_SINGLE_INSTRUCTION_CJMADD and CJMADD, which were effectively unused, apart from on x86, where the change results in identically performing code. by Rasmus Munk Larsen · 3 years, 10 months ago
- 52a5f98 Get rid of code duplication for conj_helper. For packets where LhsType=RhsType a single generic implementation suffices. For scalars, the generic implementation of pconj automatically forwards to numext::conj, so much of the existing specialization can be avoided. For mixed types we still need specializations. by Rasmus Munk Larsen · 3 years, 10 months ago
- fc87e2c Use bit_cast to create -0.0 for floating point types to avoid compiler optimization changing sign with --ffast-math enabled. by Rasmus Munk Larsen · 3 years, 10 months ago
- dba753a Add missing NEON ptranspose implementations. by Antonio Sanchez · 3 years, 11 months ago
- 722ca0b Revert addition of unused `paddsub<Packet2cf>`. This fixes #2242 by Christoph Hertzberg · 4 years ago
- 172db7b Add missing pcmp_lt_or_nan for NEON Packet4bf. by Antonio Sanchez · 4 years ago
- 4811e81 Remove yet another comma at end of enum by David Tellenbach · 4 years, 1 month ago
- 82d61af Fix rint SSE/NEON again, using optimization barrier. by Antonio Sanchez · 4 years, 1 month ago
- 9a66397 Revert "Fix rint for SSE/NEON." by Antonio Sánchez · 4 years, 1 month ago
- e72dfeb Fix rint for SSE/NEON. by Antonio Sanchez · 4 years, 1 month ago
- 1e0c7d4 Add print for SSE/NEON, use NEON rounding intrinsics if available. by Antonio Sanchez · 4 years, 1 month ago
- 29ebd84 Fix NEON sqrt for 32-bit, add prsqrt. by Antonio Sanchez · 4 years, 1 month ago
- e19829c Fix floor/ceil for NEON fp16. by Antonio Sanchez · 4 years, 1 month ago
- 5529db7 Fix SSE/NEON pfloor/pceil for saturated values. by Antonio Sanchez · 4 years, 1 month ago
- 6cf0ab5 Disable fast psqrt for NEON. by Antonio Sanchez · 4 years, 1 month ago
- 7ff0b7a Updated pfrexp implementation. by Antonio Sanchez · 4 years, 2 months ago
- f702792 missing method in packetmath.h void ptranspose(PacketBlock<Packet16uc, 4>& kernel) by Ashutosh Sharma · 4 years, 2 months ago
- 90ee821 Use vrsqrts for rsqrt Newton iterations. by Antonio Sanchez · 4 years, 2 months ago
- 7eb07da loop less ptranspose by Ashutosh Sharma · 4 years, 2 months ago
- f85038b Fix excessive GEBP register spilling for 32-bit NEON. by Antonio Sanchez · 4 years, 2 months ago
- b2126fd Fix pfrexp/pldexp for half. by Antonio Sanchez · 4 years, 3 months ago
- 38ae535 1)provide a better generic paddsub op implementation by Guoqiang QI · 4 years, 3 months ago
- 0575410 * Add iterative psqrt<double> for AVX and SSE when FMA is available. This provides a ~10% speedup. by Rasmus Munk Larsen · 4 years, 4 months ago
- 6cee8d3 Add an additional step of Newton-Raphson for `psqrt<double>` on Arm, which otherwise has an error of ~1000 ulps. by Rasmus Munk Larsen · 4 years, 4 months ago
- 55967f8 Fix NEON pmax<PropagateNumbers,Packet4bf>. by Antonio Sanchez · 4 years, 4 months ago
- c7eb3a7 Don't guard psqrt for std::complex<float> with EIGEN_ARCH_ARM64 by David Tellenbach · 4 years, 4 months ago
- bccf055 Add Armv8 guard on PropagateNumbers implementation. by Everton Constantino · 4 years, 4 months ago
- 00be0a7 Fix vectorization of complex sqrt on NEON by David Tellenbach · 4 years, 4 months ago
- 8eb461a Remove comma at end of enumerator list in NEON PacketMath by David Tellenbach · 4 years, 4 months ago
- baf9d76 - Enabling PropagateNaN and PropagateNumbers for NEON. by Everton Constantino · 4 years, 5 months ago
- e2f2146 Special function implementations for half/bfloat16 packets. by Antonio Sanchez · 4 years, 4 months ago
- 70fbcf8 Fix typo in `F32MaskToBf16Mask`. by Antonio Sanchez · 4 years, 4 months ago
- 2627e2f Fix neon cmp* functions for bf16. by Antonio Sanchez · 4 years, 4 months ago
- fd1dcb6 Fixes duplicate symbol when building blas by Antonio Sanchez · 4 years, 5 months ago
- 11e4056 Re-enable Arm Neon Eigen::half packets of size 8 by David Tellenbach · 4 years, 5 months ago
- e9b55c4 Avoid promotion of Arm __fp16 to float in Neon PacketMath by David Tellenbach · 4 years, 5 months ago
- 394f564 Unify Inverse_SSE.h and Inverse_NEON.h into a single generic implementation using PacketMath. by Guoqiang QI · 4 years, 5 months ago
- 8324e5e Fix typo in NEON/PacketMath.h by guoqiangqi · 4 years, 5 months ago
- e265f7e Add support for Armv8.2-a __fp16 by David Tellenbach · 4 years, 5 months ago
- 4e4d3f3 Clean up packetmath tests and fix various bugs to make bfloat16 pass (almost) all packetmath tests with SSE, AVX, and AVX512. by Rasmus Munk Larsen · 4 years, 6 months ago
- 9aad16b Fix undefined reference to pset1frombits bug on different platforms by guoqiangqi · 4 years, 7 months ago
- 14022f5 Fix more mildly embarrassing typos in ARM intrinsics in PacketMath.h. by Rasmus Munk Larsen · 4 years, 7 months ago
- a5b2269 Fix typo in PacketMath.h by Rasmus Munk Larsen · 4 years, 7 months ago
- 3af744b Add missing packet op pcmp_lt_or_nan for Packet2d on ARM. by Rasmus Munk Larsen · 4 years, 7 months ago
- 880fa43 Add support for CastXML on ARM aarch64 by Brad King · 4 years, 7 months ago
- cc0c38a Remove old Clang compiler bug work-arounds. The two LLVM bugs referenced in the comments here have long been fixed. The workarounds were now detrimental because (1) they prevented using fused mul-add on Clang/ARM32 and (2) the unnecessary 'volatile' in 'asm volatile' prevented legitimate reordering by the compiler. by Benoit Jacob · 4 years, 7 months ago
- 3012e75 Add plog ops support packet2d for NEON by Guoqiang QI · 4 years, 7 months ago