asmjit

mirror of https://github.com/asmjit/asmjit.git synced 2025-12-18 13:04:36 +03:00

Author	SHA1	Message	Date
kobalicek	7596c6d035	[abi] AsmJit v1.18 - performance and memory footprint improvements * Refactored the whole codebase to use snake_case convention to name functions and variables, including member variables. Class naming is unchanged and each starts with upper-case character. The intention of this change is to make the source code more readable and consistent across multiple projects where AsmJit is currently used. * Refactored support.h to make it more shareable across projects. * x86::Vec now inherits from UniVec * minor changes in JitAllocator and WriteScope in order to make the size of WriteScope smaller * added ZoneStatistics and Zone::statistics() getter * improved x86::EmitHelper to use tables instead of choose() and other mechanisms to pick between SSE and AVX instructions * Refactored the whole codebase to use snake_case convention for for functions names, function parameter names, struct members, and variables * Added a non-owning asmjit::Span<T> type and use into public API to hide the usage of ZoneVector in CodeHolder, Builder, and Compiler. Users now only get Span (with data and size), which doesn't require users to know about ZoneVector * Removed RAWorkId from RATiedReg in favor of RAWorkReg* * Removed GEN from LiveInfo as it's not needed by CFG construction to save memory (GEN was merged with LIVE-IN bits). The remaining LIVE-IN, LIVE-OUT, and KILL bits are enough, however KILL bits may be removed in the future as KILL bits are not needed after LIVE-IN and LIVE-OUT converged * Optimized the representation of LIVE-IN, LIVE-OUT, and KILL bits per block. Now only registers that live across multiple basic blocks are included here, which means that virtual registers that only live in a single block are not included and won't be overhead during liveness analysis. This optimization alone can make liveness analysis 90% faster depending on the code generated (more virtual registers that only live in a single basic block -> more gains) * Optimized building liveness information bits per block. The new code uses an optimized algorithm to prevent too many traversals and uses a more optimized code for a case in which not too many registers are used (it avoids array operations if the number of all virtual registers within the function fits a single BitWord) * Optimized code that computes which virtual register is only used in a single basic block - this aims to optimize register allocator in the future by using a designed code path for allocating regs only used in a single basic block * Reduced the information required for each live-span, which is used by bin-packing. Now the struct is 8 bytes, which is good for a lot of optimizations C++ compiler can do * Added UniCompiler (ujit) which can be used to share code paths between X86, X86_64, and AArch64 code generation (experimental).	2025-09-06 13:44:34 +02:00
kobalicek	2ff454d415	[abi] AsmJit v1.17 - cumulative & breaking changes * Reworked register operands - all vector registers are now platform::Vec deriving from UniVec (universal vector operand), additionally, there is no platform::Reg, instead asmjit::Reg provides all necessary features to make it a base register for each target architecture * Reworked casting between registers - now architecture agnostic names are preferred - use Gp32 instead of Gpd or GpW, Gp64 instead of Gpq and GpX, etc... * Reworked vector registers and their names - architecture agnostic naming is now preferred Vec32, Vec64, Vec128, etc... * Reworked naming conventions used across AsmJit - for clarity Identifiers are now prefixed with the type, like sectionId(), labelId(), etc... * Reworked how Zone and ZoneAllocator are used across AsmJit, prefering Zone in most cases and ZoneAllocator only for containers - this change alone achieves around 5% better performance of Builder and Compiler * Reworked LabelEntry - decreased the size of the base entry to 16 bytes for anonymous and unnamed labels. Avoided an indirection when using labelEntries() - LabelEntry is now a value and not a pointer * Renamed LabelLink to Fixup * Added a new header <asmjit/host.h> which would include <asmjit/core.h> + target tools for the host architecture, if enabled and supported * Added new AArch64 instructions (BTI, CSSC, CHKFEAT) * Added a mvn_ alternative of mvn instruction (fix for Windows ARM64 SDK) * Added more AArch64 CPU features to CpuInfo * Added better support for Apple CPU detection (Apple M3, M4) * Added a new benchmarking tool asmjit_bench_overhead, which benchmarks the overhead of CodeHolder::init()/reset() and creating/attaching emitters to it. Thanks to the benchmark the most common code-paths were optimized * Added a new benchmarking tool asmjit_bench_regalloc, which aims to benchmark the cost and complexity of register allocation. * Renamed asmjit_test_perf to asmjit_bench_codegen to make it clear what is a test and what is a benchmark	2025-06-15 16:45:37 +02:00
kobalicek	356dddbc55	[abi] Switched to C++17	2025-05-24 19:21:17 +02:00
kobalicek	976f8ed35a	[bug] Properly use vpternlog write-only cases (x86::Compiler)	2024-12-15 18:50:15 +01:00
kobalicek	7bed2b0e14	[bug] Fixed tablegen to properly add implicit zeroing flag It was ignoring the flag and as a result the instruction such as vpcmpeqd k1 {k2}, zmm1, zmm2 would have k1 set as Read/Write, which is incorrect.	2024-11-22 20:26:31 +01:00
kobalicek	b9c8b5399f	[Bug] Fixed MOV reg->mem instruction rewriting (Compiler) The problem is that the rewriter must also rewrite an instruction ID in case that it's a [K\|V]MOV[B\|W\|D\|Q] instruction that moves from either K or SIMD register to GP register. when such instruction is rewritten in a way that it ends up as "xMOVx GP, [MEM]" it would be invalid if it's not changed to a general purpose MOV. The problem can only happen in case that the compiler spills a virtual register, which is then moved to a scalar register. In addition, checks were added to MOVD\|MOVQ to ensure that when an invalid instruction is emitted it's not ignored as it used to be.	2024-05-19 17:51:16 +02:00
kobalicek	118ae6ced1	[Bug] Fixed sign and zero extension in Function API (x86) The problem was that the sign/zero extension was not working with TypeIds, but it was working with register IDs, when emitted. Thus, even when TypeId was uint8, for example, the register could be uint32, so the final instruction emitted was 'movzx eax, eax', for example, which was wrong.	2024-01-13 14:31:35 +01:00
kobalicek	3772c447ca	[ABI] Accumulated API/ABI changes * Renamed all eq() methods to equals() (consistency) (ABI) * Reorganized some X86 instructions in X86 database * Properly detect RISC-V CPU at compile time (Environment) * Removed CallConvId::kNone in favor of defaulting to kCDecl (ABI) * CallConvId::kHost is now alias to CallConvId::kCDecl (ABI) * Added FloatABI to Environment to disginguish between softfloat and hardfloat * Added more AArch64 CPU features and their detection (ABI) * Because of CallConvId changes it's now possible to run compiler tests on 32-bit ARM (fixes a bug in test cases) * Added QEMU to CI build matrix to test different architectures	2024-01-01 20:15:00 +01:00
kobalicek	a465fe71ab	[ABI] Reworked FuncSignature	2023-12-27 21:17:45 +01:00
kobalicek	073f6e85e4	[ABI] Improvements to avoid UB and warnings, clean build with MSAN * Added more clang compilers on CI (CI) * Added memory sanitizer to build matrix (CI) * Use problem matcher in all builds (CI) * Fixed the use of some constructs in tests * Fixed warnings about unused functions in tests * Fixed warnings about unused variables caused by some build options * Fixed tests to be clean with MSAN (zeroing memory filled by JIT code) * Removed -Wclass-memaccess (gcc) from ignored warnings * Removed -Wconstant-logical-operand (clang) from ignored warnings * Removed -Wunnamed-type-template-args (clang) from ignored warnings * Reworked InstData and InstExData to not cause UB (ABI break) Unfortunately the existing InstData and InstExData was not good for static analysis and in general compilers emitted warnings regarding accessing InstNode::_opArray. The reason was that InstExNode added one or two more operands which extended InstData::_opArray, but there was no way to tell the C++ compiler about this layout. It has been changed to InstNode having no operands and InstNodeWithOperands being templatized for the right number of operands. Nodes that need to inherit InstNode would just inherit InstNodeWithOperands<N>. It works the same way as before, just the class hierarchy changed a little.	2023-12-26 19:00:00 +01:00
kobalicek	7c10a14d34	Added indirect branch protection to function abstraction * Adds a new entry to FuncAttributes * Adds a new API to FuncFrame	2023-12-19 12:39:55 +01:00
kobalicek	c1019f1642	Improved testing * Refactored workflows to use a single workflow for both VM and non-VM builds * Compiler tests are now able to test compilation of different architectures	2023-03-11 00:31:03 +01:00
kobalicek	052b4430e9	Fixed indirect jumps to a single label (Compiler) (fixes #380 )	2022-09-18 10:26:53 +02:00
kobalicek	a4cb51b532	[Bug] Fixed not cloberring YMM\|ZMM registers in function calls that preserve only low 128-bits of vector registers	2022-04-10 00:32:57 +02:00
kobalicek	996deae327	[ABI] Refactored AsmJit to use strong-typed enums, this breaks both API and ABI [ABI] Added ABI version as an inline namespace, which forms asmjit::_abi_MAJOR_MINOR [ABI] Added support for AVX512_FP16, 16-bit broadcast, and AVX512_FP16 tests [ABI] Added initial support for consecutive registers into instruction database and register allocator [ABI] Added a possibility to use temporary memory in CodeHolder's zone [ABI] Compiler::setArg() is now deprecated, use FuncNode::setArg() [Bug] Fixed correct RW information of instructions that only support implicit zeroing with {k} [Bug] Fixed broadcast to be able to broadcast bcst16 operands	2021-12-13 19:34:56 +01:00
kobalicek	a4dd0b2d8b	[ABI] Build improvements - replaced ASMJIT_BUIlD_X86 with ASMJIT_NO_X86 and other changes...	2021-03-21 14:42:47 +01:00
kobalicek	7836449c30	Added asmjit_test_perf, which replaces asmjit_bench and provides much better performance overview Removed asmjit_test_opcode (not needed anymore as we have asmjit_test_assembler and asmjit_test_perf)	2021-03-13 23:05:48 +01:00
kobalicek	70e80b18a5	[Bug] Jump annotation to entry block fix	2021-01-02 12:25:17 +01:00
kobalicek	88129d7389	[Bug] Don't unlink immediately when creating anonymous memory file, switch to GH actions (Fixes #312 )	2020-11-07 00:02:16 +01:00
kobalicek	cd44f41d9b	Preparation for AArch64 support	2020-09-12 18:19:04 +02:00

20 Commits