Increased size of cs_isn.bytes to 33 to be able to hold big EVM instructions by f0rki · Pull Request #1231 · capstone-engine/capstone

f0rki · 2018-08-08T14:31:50Z

The EVM arch has a couple of really big instructions, with the biggest one being PUSH32 (with 33 bytes size). Currently cs_inst can only hold 16 bytes of raw instruction data, which is not enough for EVM. This results in the bytes being truncated to 16 bytes. This didn't really affect the disassembly or the op_str, but if you use cs_inst.size or cs_inst.bytes afterwards, it would return wrong data for those big instructions (PUSH15 up to PUSH32).

aquynh · 2018-08-09T15:36:09Z

33 looks weird though, should we round the size up to some other number?

f0rki · 2018-08-09T19:27:52Z

I don't know. It already feels like a lot of wasted memory for all the other archs.
Maybe it should also be surrounded by #ifdef HAS_EVM and fall back to the 16 byte size?

) * m68k: store correct m68k_reg value in op.reg_pair Originally, value - M68K_REG_D0 was stored and the print logic added M68K_REG_D0. * m68k: fix license typo

* normalize tab character in cs

* normalize tab character in cs * normalize in issue mode

…1415)

* Added RISCV dir to contain the RISCV architecture engine code. Adding the TableGen files generated from llvm-tblgen. Add Disassembler.h * Started working on RISCVDisassembler.c - RISCV_init(), RISCVDisassembler_getInstruction, and RISCV_getInstruction * Added all functions to RISCVDisassembler.c and needed modifications to RISCVGenDisassemblerTables.inc. Add and modified RISCVGenSubtargetInfo.inc. Start creation of RISCVInstPrinter.h * Finished RISCVGenAsmWriter.inc. Finished RISCVGenRegisterInfo.inc. Minor fixes to RISCVDisassembler.c. Working on RISCVInstPrinter * Finished RISCVInstPrinter, RISCVMapping, RISCVBaseInfo, RISCVGenInstrInfo.inc, RISCVModule.c. Working on riscv.h * Backport it from: porto703@0db412c * All RISCV files added. Compiled correctly and initial test for ADD, ADDI, AND works properly. * Add refactored cs.c for RISCV * Testing all I instructions in test_riscv.c * Modify the orignal backport for RISCVGenRegisterInfo.inc, capstone.h and test_iter to work w/ the current code strcuture * Fix issue with RISCVGenRegisterInfo.inc - RISCVRegDesc[] (Excess elements in struct initializer). Added RISCV tests to test_iter.c * fixed bug related to incorrect initialization of memory after malloc * fix compile bug * Fix compile errors. * move riscv.h to include/capstone * fix indentation issues * fix coding style issues * Fix indentation issues * fix coding style * Move variable declaration to the top of the block * Fix coding indentation * Move some stuff into RISCVMappingInsn.inc * Fix code sytle * remove cs_mode support for RISCV * update asmwriter-inc to LLVM upstream * update the .inc files to riscv upstream * update riscv disassembler function for suport 16bit instructions * update printer & tablegen inc files which have fixed arguments mismatch * update headers and mapping source * add riscv architecture specific test code * fix all RISCV tons of compiler errors * pass final tests * add riscv tablegen patchs * merge with upstream/next * fix cstool missing riscv file * fix root Makefile * add new TableGen patchs for riscv * fix cmakefile.txt of missing one riscv file * fix declaration conflict * fix incompatible declaration type * change riscvc from arch to mode * fix test_riscv warnning * fix code style and add riscv part of test_basic * add RISCV64 mode * add suite for riscv * crack fuzz test * fix getfeaturebits test add riscvc * fix test missing const qualifier warnning * fix testcase type mismatch * fix return value missing * change getfeaturebits test * add test cs files * using a winder type contain the decode string * fix a copy typo * remove useless mode for riscv * change cs file blank type * add repo for update_riscv & fix cstool missing riscv mode * fix typo * add riscv for cstool useage * add TableGen patch for riscv asmwriter * clean ctags file * remove black comment line * fix fuzz related something * fix missing RISCV string of fuzz * update readme, etc.. * add riscv *.s.cs file * add riscv *.s.cs file & clear ctags * clear useless array declarations at capstone_test * update to 5e4069f * update readme change name more formal * change position of riscv after bpf and modify copyright more uniform * clear useless ctags file * change blank with tab in riscv.h * add riscv python bindings * add riscv in __init__.py * fix riscv define value for python binding * fix test_riscv.py typo * add missing riscvc in __init__.py of python bindings * fix alias-insn printer bug, remove useless newline * change inst print delimter from tab to bankspace for travis * add riscv tablegen patch * fix inst output more consistency * add TableGen patch which fix inst output formal * crack the effective address output for detail and change register print function * fix not detail crash bug * change item declaration position at cs_riscv * update riscv.py * change function name more meaningfull * update python binding makefile * fix register enum sequence according to riscvgenreginfo.inc * test function name * add enum s0/fp in riscv.h & update riscv_const.py * add register name enum

capstone-engine#1421)

* fix bug in displacement offset * fix k0-k7 registers in X86 table.

…e-engine#1702) * mos65xx: use imm field for immediate operand value using the wrong field works on little-endian hosts, but on big-endian the wrong value would be read * mos65xx: set operand mem field to address also in relative modes previously the last operand would have an offset, which doesn't match the printed operand * mos65xx: add bpl instruction to test this demonstrates an address operand with relative addressing

…ne#1703)

https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=22236 Same as capstone-engine#1687 for next branch

) Co-authored-by: pancake <pancake@nopcode.org>

This was initially introduced in dce7da9 but lost in the LLVM 7 sync in 5a99624.

kabeor · 2021-11-10T12:58:18Z

I don't know. It already feels like a lot of wasted memory for all the other archs. Maybe it should also be surrounded by #ifdef HAS_EVM and fall back to the 16 byte size?

it's a good idea, can you do this continue?

…ieri/moffset_disp Fix the displacement offset for moffset-encoded operands

fixed library extension to build properly under CYGWIN

Correcting X86 Imm Size

tmfink · 2021-11-11T06:22:15Z

master branch is not really used, new development should go into next branch (unable to disassemble f3 48 0f 1e c8 (rdsspq rax) in Ubuntu 20.04 #1759)
Changing the size of a field in cs_insn is an ABI break, meaning a change like this should only go into next branch (and not backported to older branches)
- As a consequence, I don't think we want the memory layout of cs_insn to change as a result of something like HAS_EVM. Otherwise, the ABI used in a libcapstone.so would depend on whether the library was compiled with HAS_EVM. Any headers distributed headers and and all language bindings would need to know how the library was compiled.

kabeor · 2021-11-11T06:27:57Z

master branch is not really used, new development should go into next branch (unable to disassemble f3 48 0f 1e c8 (rdsspq rax) in Ubuntu 20.04 #1759)

Changing the size of a field in cs_insn is an ABI break, meaning a change like this should only go into next branch (and not backported to older branches)

yes, plz rebase your branch to 'next'. 'master' is not accept to PR

As a consequence, I don't think we want the memory layout of cs_insn to change as a result of something like HAS_EVM. Otherwise, the ABI used in a libcapstone.so would depend on whether the library was compiled with HAS_EVM. Any headers distributed headers and and all language bindings would need to know how the library was compiled.

@aquynh any thoughts?

… EVM instructions (PUSH15 until PUSH32)

f0rki · 2021-11-11T13:42:09Z

I rebased to next.

aquynh · 2021-11-11T15:01:00Z

i hesitate on this PR since it would break the API :-(

sylvainpelissier · 2022-01-31T17:17:59Z

cs_isn.bytes size is internal only no ? why it would break the API ?

f0rki · 2022-02-01T09:53:08Z

Alternatively: Would it make sense to move the opcode constant to the cs_evm struct?

tmfink · 2022-02-01T19:43:00Z

cs_isn.bytes size is internal only no ? why it would break the API ?

cs_insn is in capstone.h, which is a public header. Changing the bytes field changes the size of cs_insn which is an ABI break.

david942j · 2022-03-27T16:28:06Z

Will it make more sense to declare the field bytes to the very end of struct cs_insn as a dynamic-length array? Something like:

struct cs_insn {
  // other fields..
  uint32_t len_bytes; // length of "bytes"
  uint8_t bytes[];
};

Or alternatively declaring bytes as a pointer just like the detail field.

The length changing of bytes was the reason that Capstone has to bump from 4 to 5 (see #1315 (comment)), maybe it's better to consider a solution instead of bumping major versions because of "minor" fixes.

tmfink · 2022-03-28T09:53:18Z

The length changing of bytes was the reason that Capstone has to bump from 4 to 5 (see #1315 (comment)), maybe it's better to consider a solution instead of bumping major versions because of "minor" fixes.

Adding an extra pointer indirection/allocation is less efficient but could help maintain ABI compatibility. It depends on what we want to do.

Rot127 · 2024-03-20T09:34:48Z

Thank you for the PR! I closed it because it is out of date. With the new auto-sync update for v6 we made many changes to some main architectures and will do also to others.
This also changed the requirements we have now for new PRs.

If you still want to merge the changes, please rebase your fix onto the newest next branch and open a new PR.

aquynh and others added 28 commits March 2, 2019 14:59

bingdings: update X86 consts

d250f06

fuzz_disasm: declare cs_fuzz_arch()

ef940af

x86: add BND registers to regsize_map_32 & regsize_map_64

ced24fc

x86: remove PRINT_ALIAS_INSTR

b6b0af7

[M68K] store correct register value in op.reg_pair (capstone-engine#1411

8ce800c

) * m68k: store correct m68k_reg value in op.reg_pair Originally, value - M68K_REG_D0 was stored and the print logic added M68K_REG_D0. * m68k: fix license typo

normalize tab character in cs (capstone-engine#1413)

e416726

* normalize tab character in cs

normalize in issue mode (capstone-engine#1414)

b2e1c0b

* normalize tab character in cs * normalize in issue mode

x86: new files X86GenRegisterName.inc & X86GenRegisterName1.inc

55a65a7

x86: operand access for BND instructions

0e8d2f0

Avoids type confusion in cpu12 for M680X (capstone-engine#1417)

238b4b6

Fixes uninitialized memory for X86 BND instructions (capstone-engine#…

6a769e8

…1415)

x86: operand size of BNDxxx is 16

fd9dbbc

arm: cleanup ARMGenInstrInfo.inc

cc304c7

cstest: build with local libcapstone

63396f8

Merge branch 'next' of github.com:aquynh/capstone into next

3d6dd77

riscv: coding style cleanup

96fbf15

cleanup tests/

de1f713

put together all static architecture setups in cs.c

e77be5e

Corpus generation is more robust (capstone-engine#1419)

f6ccb88

Fix capstone-engine#1420: Capstone 4 fails to build when targeting UWP (

e0340ad

capstone-engine#1421)

Fix memory leak in RISC V (capstone-engine#1424)

7a947b3

cstool: add armv8 & thumbv8 to usage instruction

e4ea0c9

cstool: arm v8, thumb v8

925b74b

cstool: add armv8be & thumbv8be modes

5fc297f

arm: sync with llvm 7.0.1

124f91b

bindings: update ARM const after the last ARM update

ba2c6a2

arm: fix warnings reported by MSVC

c92ba6f

aeflores and others added 8 commits March 7, 2021 21:57

x86 Fix AVX-512 k registers (capstone-engine#1689)

4afdd97

* fix bug in displacement offset * fix k0-k7 registers in X86 table.

Always return the same type from regs_read (capstone-engine#1736)

1703efd

use ".byte" when skipdata is set up with NULL mnemonic (capstone-engi…

27ac4c0

…ne#1703)

ppc: fix registers overflow (capstone-engine#1688)

702dbe7

https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=22236 Same as capstone-engine#1687 for next branch

Use braces instead of indentation. C is not Python (capstone-engine#1745

f278de3

) Co-authored-by: pancake <pancake@nopcode.org>

Fix the displacement offset for moffset-encoded operands

cd66cb2

This was initially introduced in dce7da9 but lost in the LLVM 7 sync in 5a99624.

Adds oss-fuzz badge (capstone-engine#1541)

7ae0770

Smartsmurf and others added 4 commits November 10, 2021 17:05

switched to next branch

9dbc677

Merge pull request capstone-engine#1754 from jranieri-grammatech/jran…

c7538d4

…ieri/moffset_disp Fix the displacement offset for moffset-encoded operands

Merge pull request capstone-engine#1791 from Smartsmurf/next

dcaeafe

fixed library extension to build properly under CYGWIN

Merge pull request capstone-engine#1657 from NicolasDerumigny/next

7e886c7

Correcting X86 Imm Size

Increased size of cs_isn.bytes to 33 (from 16) to be able to hold big…

6f27bfa

… EVM instructions (PUSH15 until PUSH32)

f0rki force-pushed the evm_bytes_size branch from b8dfaaf to 6f27bfa Compare November 11, 2021 12:52

f0rki changed the base branch from master to next November 11, 2021 12:54

sylvainpelissier mentioned this pull request Feb 1, 2022

Improve EVM analysis radareorg/radare2#19650

Merged

4 tasks

f0rki mentioned this pull request Feb 1, 2022

Incomplete EVM Support #1838

Open

kabeor force-pushed the next branch from c182d0e to d78d0ca Compare November 20, 2023 02:51

Rot127 closed this Mar 20, 2024

Conversation

f0rki commented Aug 8, 2018

Uh oh!

aquynh commented Aug 9, 2018

Uh oh!

f0rki commented Aug 9, 2018

Uh oh!

kabeor commented Nov 10, 2021

Uh oh!

tmfink commented Nov 11, 2021

Uh oh!

kabeor commented Nov 11, 2021

Uh oh!

f0rki commented Nov 11, 2021

Uh oh!

aquynh commented Nov 11, 2021

Uh oh!

sylvainpelissier commented Jan 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

f0rki commented Feb 1, 2022

Uh oh!

tmfink commented Feb 1, 2022

Uh oh!

david942j commented Mar 27, 2022

Uh oh!

tmfink commented Mar 28, 2022

Uh oh!

Rot127 commented Mar 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

19 participants

sylvainpelissier commented Jan 31, 2022 •

edited

Loading