Bytecode parity by youknowone · Pull Request #7481 · RustPython/RustPython

youknowone · 2026-03-23T04:24:26Z

Compiler changes:
- Remove PUSH_NULL from decorator cal ls, use CALL 0
- Collect static_attributes from self.xxx = patterns
- Sort static_attributes alphabetically
- Move classdict init before doc in class prologue
- Fold unary negative constants
- Fold constant list/set literals (3+ elements)
- Use BUILD_MAP 0 + MAP_ADD for 16+ dict pairs
- Always run peephole optimizer for s uperinstructions
- Emit RETURN_GENERATOR for generator functions
- Add is_generator flag to SymbolTabl e

Summary by CodeRabbit

Bug Fixes
- Fixed spurious trace events in generator and coroutine functions.
Performance Improvements
- Optimized compilation of decorators, class attributes, constants, and dictionary literals to reduce memory usage and execution overhead.

coderabbitai · 2026-03-23T04:24:33Z

📝 Walkthrough

Walkthrough

Multiple compiler and runtime components were enhanced: async/generator resume preamble handling, decorator application, class static attribute collection, dict literal optimization, constant-folding passes (unary negative, lists, sets), generator detection in symbol tables, JIT instruction support for StoreFastStoreFast, and generator frame initialization adjustments to prevent spurious trace events.

Changes

Cohort / File(s)	Summary
Compiler codegen – Resume and decorators `crates/codegen/src/compile.rs`	Updated `ReturnGenerator` + `PopTop` emission to apply to both async and generator scopes (via `is_generator` check). Reworked decorator compilation: removed `PushNull` after each decorator, changed `apply_decorators` to emit `Call { argc: 0 }` per iteration instead of `argc: 1`.
Compiler codegen – Class static attributes `crates/codegen/src/compile.rs`	Introduced `collect_static_attributes` and `scan_store_attrs` helpers to scan class bodies for self-attribute assignments in control-flow blocks. Integrated collection into `compile_class_body` and emitted deterministic `__static_attributes__` tuple. Adjusted `__classdict__` initialization ordering above `__doc__` storage.
Compiler codegen – Dict optimization `crates/codegen/src/compile.rs`	Optimized large dict literal generation: switched from `BuildMap { count: len }` to incremental `BuildMap { count: 0 }` + repeated `MapAdd` to reduce stack usage.
IR constant folding `crates/codegen/src/ir.rs`	Added three new constant-folding passes: `fold_unary_negative()` (negates constants), `fold_list_constants()` (folds constant lists into tuple + `LIST_EXTEND`), `fold_set_constants()` (folds constant sets into tuple + `SET_UPDATE`). Reordered finalization pipeline and ensured `peephole_optimize()` always runs.
Symbol table – Generator detection `crates/codegen/src/symboltable.rs`	Added `pub is_generator: bool` field to `SymbolTable`, initialized to `false`. Updated `SymbolTableBuilder` to mark scopes as generators when `yield` or `yield from` expressions are encountered.
JIT instruction support `crates/jit/src/instructions.rs`	Added handling for `StoreFastStoreFast { var_nums }` bytecode: decodes two variable indices, pops two stack values, and stores each to corresponding local variables.
VM frame – Generator initialization and StoreFastStoreFast `crates/vm/src/frame.rs`	Generator/coroutine frames now initialize `prev_line` to code object's `first_line_number` (not `0`) to prevent spurious `LINE` trace events from preamble bytecode. Updated `StoreFastStoreFast` to use `pop_value_opt()` for both values, allowing `NULL` values from `LoadFastAndClear` to propagate correctly.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Reduce per-instruction overhead in interpreter loop #7315 – Both PRs modify generator/coroutine frame initialization and trace event handling in prev_line logic.
Newtype var_nums oparg #7431 – Both PRs add JIT and VM handling for the StoreFastStoreFast instruction and decode var_nums indices.
cold block reordering and jump normalization #7210 – Both PRs modify the constant-folding and finalization pipeline ordering in crates/codegen/src/ir.rs.

Suggested reviewers

ShaharNaveh

🐰 Generators now know they're generators, decorators dance with cleaner stacks,
Classes collect their attributes with care, constants fold in clever ways,
And dicts grow gracefully—no excess burden, just elegant compilation! ✨

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The title 'Bytecode parity' is vague and generic, using a non-descriptive term that doesn't clearly convey what specific changes were made, despite the PR implementing multiple significant compiler optimizations and bytecode changes.	Consider using a more specific title that highlights the main changes, such as 'Compiler optimizations: decorator calls, static attributes, and constant folding' or 'Improve bytecode generation with decorator call optimization and constant folding'.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-03-23T05:53:19Z

📦 Library Dependencies

The following Lib/ modules were modified. Here are their dependencies:

(module 'compile test_peepholer test_sys_settrace' not found)

Legend:

[+] path exists in CPython
[x] up-to-date, [ ] outdated

Compiler changes: - Remove PUSH_NULL from decorator cal ls, use CALL 0 - Collect __static_attributes__ from self.xxx = patterns - Sort __static_attributes__ alphabetically - Move __classdict__ init before __doc__ in class prologue - Fold unary negative constants - Fold constant list/set literals (3+ elements) - Use BUILD_MAP 0 + MAP_ADD for 16+ dict pairs - Always run peephole optimizer for s uperinstructions - Emit RETURN_GENERATOR for generator functions - Add is_generator flag to SymbolTabl e

- Replace irrefutable if-let with let for ExceptHandler - Remove folded UNARY_NEGATIVE instead of replacing with NOP, enabling chained negation folding - Initialize prev_line to def line for generators/coroutines to suppress spurious LINE events from preamble instructions - Remove expectedFailure markers for now-passing tests

- Add StoreFastStoreFast handling in JIT instructions - Fix cargo fmt in frame.rs - Remove 11 expectedFailure markers for async jump tests in test_sys_settrace that now pass

Using remove() shifts instruction indices and corrupts subsequent references, causing "pop stackref but null found" panics at runtime. Replace folded/combined instructions with NOP instead, which are cleaned up by the existing remove_nops pass.

NOP replacement broke chaining of peephole optimizations (e.g. LOAD_CONST+TO_BOOL then LOAD_CONST+UNARY_NOT for 'not True'). The remove() approach is used by upstream and works correctly here; fold_unary_negative keeps NOP replacement since it doesn't need chaining.

StoreFast uses pop_value_opt() to allow NULL values from LoadFastAndClear in inlined comprehension cleanup paths. StoreFastStoreFast must do the same, otherwise the peephole optimizer's fusion of two StoreFast instructions panics when restoring unbound locals after an inlined comprehension.

coderabbitai

Actionable comments posted: 5

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

crates/codegen/src/symboltable.rs (1)

1830-1840: ⚠️ Potential issue | 🟠 Major

<genexpr> never sets the new generator flag.

The new flag is only written in the Yield/YieldFrom arms. Generator expressions take the scan_comprehension(..., true) path instead, and that is_generator parameter is never copied onto the comprehension scope, so any downstream consumer of SymbolTable::is_generator will still see <genexpr> as a plain comprehension.

Possible fix

 fn scan_comprehension(
     &mut self,
     scope_name: &str,
     elt1: &ast::Expr,
     elt2: Option<&ast::Expr>,
     generators: &[ast::Comprehension],
     range: TextRange,
     is_generator: bool,
 ) -> SymbolTableResult {
     // Comprehensions are compiled as functions, so create a scope for them:
     self.enter_scope(
         scope_name,
         CompilerScope::Comprehension,
         self.line_index_start(range),
     );
+    self.tables.last_mut().unwrap().is_generator = is_generator;

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/codegen/src/symboltable.rs` around lines 1830 - 1840, Generator
expressions created via scan_comprehension(..., true) never set the
SymbolTable::is_generator flag; update the code that creates/completes a
comprehension (the scan_comprehension path for genexpr) so that when its
"is_generator" parameter is true it writes to the current scope's flag (e.g.,
call tables.last_mut().unwrap().is_generator = true inside scan_comprehension or
copy the boolean into the new comprehension scope), mirroring how the
Expr::Yield/Expr::YieldFrom arms set is_generator in scan_expression.

🧹 Nitpick comments (1)

crates/codegen/src/compile.rs (1)
2758-2769: Update the stale decorator stack comment.

Line 2759 still documents prepare_decorators as leaving NULL sentinels on the stack, but this path now relies on a plain [dec1, dec2, func] layout plus CALL 0. The code looks fine; the comment is now the misleading part.

As per coding guidelines, "Do not delete or rewrite existing comments unless they are factually wrong or directly contradict the new code".
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@crates/codegen/src/compile.rs` around lines 2758 - 2769, The comment for
prepare_decorators is stale: it mentions leaving NULL sentinels on the stack
(e.g. [dec1, NULL, dec2, NULL]) but the current implementation of
prepare_decorators and apply_decorators uses a plain [dec1, dec2, func] layout
with CALL 0; update the doc comment above prepare_decorators (and the related
example in apply_decorators) to describe the actual stack shape and
transformation (e.g. push decorators in source order to produce [dec1, dec2,
func] and then CALL 0 repeatedly to produce dec1(dec2(func))). Ensure you
reference the functions prepare_decorators and apply_decorators so the updated
comment is next to the correct code.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@crates/codegen/src/compile.rs`:
- Around line 1203-1206: The async-generator check is using CodeFlags::GENERATOR
in compile_statement which runs before mark_generator(), so async defs that are
generators slip through; when you compute is_gen (using scope_type ==
CompilerScope::AsyncFunction || self.current_symbol_table().is_generator)
propagate that generator classification into the code flags immediately (or
alternatively change the later check in compile_statement to consult
self.current_symbol_table().is_generator instead of CodeFlags::GENERATOR).
Concretely, set the generator bit on the current CodeFlags/state when is_gen is
true (or update the Line 2498 check to use current_symbol_table().is_generator)
so the async return-value guard sees the generator status before
mark_generator() is called.
- Around line 4512-4597: The collector misses methods with a pos-only first
parameter and doesn't descend into match statements; update
collect_static_attributes to pick the first param by checking
params.posonlyargs.first() before params.args.first() (or otherwise consider
both lists) when computing self_name in collect_static_attributes, and extend
scan_store_attrs to handle the match statement AST node (iterate each match
case/body and scan them) so assignments inside match arms are discovered; refer
to the functions collect_static_attributes and scan_store_attrs to locate where
to change the parameter selection and add the new match-arm recursion.

In `@crates/codegen/src/ir.rs`:
- Around line 192-196: The unary-negative folding pass (fold_unary_negative)
currently leaves a NOP after consuming UNARY_NEGATIVE which prevents
fold_tuple_constants/fold_list_constants/fold_set_constants from recognizing
adjacent LOAD_CONSTs; change fold_unary_negative to emit a real LOAD_CONST with
the negated value (i.e., replace the original LOAD_CONST + UNARY_NEGATIVE
sequence with a single LOAD_CONST of the negative literal) instead of leaving a
NOP so later BUILD_* collection folds will see contiguous LOAD_CONSTs; apply the
same fix where the same pattern appears around the other occurrences noted
(lines ~671-677) so all collection-folding passes work correctly.

In `@crates/vm/src/frame.rs`:
- Around line 708-718: The code incorrectly seeds prev_line with
code.first_line_number when code.flags contains CodeFlags::GENERATOR or
CodeFlags::COROUTINE, which suppresses the first real LINE event for single-line
generator/coroutine bodies; instead, remove the priming of prev_line and
explicitly suppress the preamble opcodes (e.g., RETURN_GENERATOR and POP_TOP)
when emitting LINE events by checking the opcode during event emission rather
than setting prev_line based on code.first_line_number; update the
initialization around prev_line and the LINE-event emission logic to skip those
specific preamble opcodes while leaving prev_line at 0 so real first-line events
are not lost.

---

Outside diff comments:
In `@crates/codegen/src/symboltable.rs`:
- Around line 1830-1840: Generator expressions created via
scan_comprehension(..., true) never set the SymbolTable::is_generator flag;
update the code that creates/completes a comprehension (the scan_comprehension
path for genexpr) so that when its "is_generator" parameter is true it writes to
the current scope's flag (e.g., call tables.last_mut().unwrap().is_generator =
true inside scan_comprehension or copy the boolean into the new comprehension
scope), mirroring how the Expr::Yield/Expr::YieldFrom arms set is_generator in
scan_expression.

---

Nitpick comments:
In `@crates/codegen/src/compile.rs`:
- Around line 2758-2769: The comment for prepare_decorators is stale: it
mentions leaving NULL sentinels on the stack (e.g. [dec1, NULL, dec2, NULL]) but
the current implementation of prepare_decorators and apply_decorators uses a
plain [dec1, dec2, func] layout with CALL 0; update the doc comment above
prepare_decorators (and the related example in apply_decorators) to describe the
actual stack shape and transformation (e.g. push decorators in source order to
produce [dec1, dec2, func] and then CALL 0 repeatedly to produce
dec1(dec2(func))). Ensure you reference the functions prepare_decorators and
apply_decorators so the updated comment is next to the correct code.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

Run ID: ea6cba1a-c905-4bf2-831a-5edbac896928

📥 Commits

Reviewing files that changed from the base of the PR and between 82432be and 289172a.

⛔ Files ignored due to path filters (7)

Lib/test/test_compile.py is excluded by !Lib/**
Lib/test/test_peepholer.py is excluded by !Lib/**
Lib/test/test_sys_settrace.py is excluded by !Lib/**
crates/codegen/src/snapshots/rustpython_codegen__compile__tests__if_ands.snap is excluded by !**/*.snap
crates/codegen/src/snapshots/rustpython_codegen__compile__tests__if_mixed.snap is excluded by !**/*.snap
crates/codegen/src/snapshots/rustpython_codegen__compile__tests__if_ors.snap is excluded by !**/*.snap
crates/codegen/src/snapshots/rustpython_codegen__compile__tests__nested_double_async_with.snap is excluded by !**/*.snap

📒 Files selected for processing (5)

crates/codegen/src/compile.rs
crates/codegen/src/ir.rs
crates/codegen/src/symboltable.rs
crates/jit/src/instructions.rs
crates/vm/src/frame.rs

coderabbitai · 2026-03-25T07:01:46Z

crates/codegen/src/compile.rs

+        // For generators and async functions, emit RETURN_GENERATOR + POP_TOP before RESUME
+        let is_gen =
+            scope_type == CompilerScope::AsyncFunction || self.current_symbol_table().is_generator;
+        if is_gen {


⚠️ Potential issue | 🟠 Major

Propagate generator classification to the async return value check.

This preamble now knows a scope is a generator before body compilation, but the async-generator guard still keys off CodeFlags::GENERATOR later in compile_statement. async def f(): return 1; yield 2 will still compile because the return is visited before mark_generator() runs. Either set the generator flag here when self.current_symbol_table().is_generator is true, or switch the Line 2498 check to the symbol-table bit.

Possible fix

- let is_gen = - scope_type == CompilerScope::AsyncFunction || self.current_symbol_table().is_generator; - if is_gen { + let is_generator = self.current_symbol_table().is_generator; + let needs_return_generator = + scope_type == CompilerScope::AsyncFunction || is_generator; + if is_generator { + self.current_code_info().flags |= bytecode::CodeFlags::GENERATOR; + } + if needs_return_generator { emit!(self, Instruction::ReturnGenerator); emit!(self, Instruction::PopTop); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// For generators and async functions, emit RETURN_GENERATOR + POP_TOP before RESUME

let is_gen =

scope_type == CompilerScope::AsyncFunction || self.current_symbol_table().is_generator;

if is_gen {

// For generators and async functions, emit RETURN_GENERATOR + POP_TOP before RESUME

let is_generator = self.current_symbol_table().is_generator;

let needs_return_generator =

scope_type == CompilerScope::AsyncFunction || is_generator;

if is_generator {

self.current_code_info().flags |= bytecode::CodeFlags::GENERATOR;

}

if needs_return_generator {

emit!(self, Instruction::ReturnGenerator);

emit!(self, Instruction::PopTop);

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/codegen/src/compile.rs` around lines 1203 - 1206, The async-generator check is using CodeFlags::GENERATOR in compile_statement which runs before mark_generator(), so async defs that are generators slip through; when you compute is_gen (using scope_type == CompilerScope::AsyncFunction || self.current_symbol_table().is_generator) propagate that generator classification into the code flags immediately (or alternatively change the later check in compile_statement to consult self.current_symbol_table().is_generator instead of CodeFlags::GENERATOR). Concretely, set the generator bit on the current CodeFlags/state when is_gen is true (or update the Line 2498 check to use current_symbol_table().is_generator) so the async return-value guard sees the generator status before mark_generator() is called.

coderabbitai · 2026-03-25T07:01:46Z

crates/codegen/src/compile.rs

+    /// Collect attribute names assigned via `self.xxx = ...` in methods.
+    /// These are stored as __static_attributes__ in the class dict.
+    fn collect_static_attributes(body: &[ast::Stmt], attrs: Option<&mut IndexSet<String>>) {
+        let Some(attrs) = attrs else { return };
+        for stmt in body {
+            // Only scan def/async def at class body level
+            let (params, func_body) = match stmt {
+                ast::Stmt::FunctionDef(f) => (&f.parameters, &f.body),
+                _ => continue,
+            };
+            // Get first parameter name (usually "self" or "cls")
+            let first_param = params
+                .args
+                .first()
+                .or(params.posonlyargs.first())
+                .map(|p| &p.parameter.name);
+            let Some(self_name) = first_param else {
+                continue;
+            };
+            // Scan function body for self.xxx = ... (STORE_ATTR on first param)
+            Self::scan_store_attrs(func_body, self_name.as_str(), attrs);
+        }
+    }
+
+    /// Recursively scan statements for `name.attr = value` patterns.
+    fn scan_store_attrs(stmts: &[ast::Stmt], name: &str, attrs: &mut IndexSet<String>) {
+        for stmt in stmts {
+            match stmt {
+                ast::Stmt::Assign(a) => {
+                    for target in &a.targets {
+                        if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) = target
+                            && let ast::Expr::Name(n) = value.as_ref()
+                            && n.id.as_str() == name
+                        {
+                            attrs.insert(attr.to_string());
+                        }
+                    }
+                }
+                ast::Stmt::AnnAssign(a) => {
+                    if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) =
+                        a.target.as_ref()
+                        && let ast::Expr::Name(n) = value.as_ref()
+                        && n.id.as_str() == name
+                    {
+                        attrs.insert(attr.to_string());
+                    }
+                }
+                ast::Stmt::AugAssign(a) => {
+                    if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) =
+                        a.target.as_ref()
+                        && let ast::Expr::Name(n) = value.as_ref()
+                        && n.id.as_str() == name
+                    {
+                        attrs.insert(attr.to_string());
+                    }
+                }
+                ast::Stmt::If(s) => {
+                    Self::scan_store_attrs(&s.body, name, attrs);
+                    for clause in &s.elif_else_clauses {
+                        Self::scan_store_attrs(&clause.body, name, attrs);
+                    }
+                }
+                ast::Stmt::For(s) => {
+                    Self::scan_store_attrs(&s.body, name, attrs);
+                    Self::scan_store_attrs(&s.orelse, name, attrs);
+                }
+                ast::Stmt::While(s) => {
+                    Self::scan_store_attrs(&s.body, name, attrs);
+                    Self::scan_store_attrs(&s.orelse, name, attrs);
+                }
+                ast::Stmt::Try(s) => {
+                    Self::scan_store_attrs(&s.body, name, attrs);
+                    for handler in &s.handlers {
+                        let ast::ExceptHandler::ExceptHandler(h) = handler;
+                        Self::scan_store_attrs(&h.body, name, attrs);
+                    }
+                    Self::scan_store_attrs(&s.orelse, name, attrs);
+                    Self::scan_store_attrs(&s.finalbody, name, attrs);
+                }
+                ast::Stmt::With(s) => {
+                    Self::scan_store_attrs(&s.body, name, attrs);
+                }
+                _ => {}
+            }
+        }
+    }


⚠️ Potential issue | 🟠 Major

The static-attribute scan misses some valid method shapes.

Two gaps here produce incomplete __static_attributes__: Line 4524 picks args before posonlyargs, so def f(self, /, x): self.a = 1 is skipped, and scan_store_attrs never descends into match cases. Both break parity on valid class bodies.

Possible fix

@@ - let first_param = params - .args - .first() - .or(params.posonlyargs.first()) - .map(|p| &p.parameter.name); + let first_param = params + .posonlyargs + .first() + .or(params.args.first()) + .map(|p| &p.parameter.name); @@ ast::Stmt::With(s) => { Self::scan_store_attrs(&s.body, name, attrs); } + ast::Stmt::Match(s) => { + for case in &s.cases { + Self::scan_store_attrs(&case.body, name, attrs); + } + } _ => {}

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

/// Collect attribute names assigned via `self.xxx = ...` in methods.

/// These are stored as __static_attributes__ in the class dict.

fn collect_static_attributes(body: &[ast::Stmt], attrs: Option<&mut IndexSet<String>>) {

let Some(attrs) = attrs else { return };

for stmt in body {

// Only scan def/async def at class body level

let (params, func_body) = match stmt {

ast::Stmt::FunctionDef(f) => (&f.parameters, &f.body),

_ => continue,

};

// Get first parameter name (usually "self" or "cls")

let first_param = params

.args

.first()

.or(params.posonlyargs.first())

.map(|p| &p.parameter.name);

let Some(self_name) = first_param else {

continue;

};

// Scan function body for self.xxx = ... (STORE_ATTR on first param)

Self::scan_store_attrs(func_body, self_name.as_str(), attrs);

}

}

/// Recursively scan statements for `name.attr = value` patterns.

fn scan_store_attrs(stmts: &[ast::Stmt], name: &str, attrs: &mut IndexSet<String>) {

for stmt in stmts {

match stmt {

ast::Stmt::Assign(a) => {

for target in &a.targets {

if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) = target

&& let ast::Expr::Name(n) = value.as_ref()

&& n.id.as_str() == name

{

attrs.insert(attr.to_string());

}

}

}

ast::Stmt::AnnAssign(a) => {

if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) =

a.target.as_ref()

&& let ast::Expr::Name(n) = value.as_ref()

&& n.id.as_str() == name

{

attrs.insert(attr.to_string());

}

}

ast::Stmt::AugAssign(a) => {

if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) =

a.target.as_ref()

&& let ast::Expr::Name(n) = value.as_ref()

&& n.id.as_str() == name

{

attrs.insert(attr.to_string());

}

}

ast::Stmt::If(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

for clause in &s.elif_else_clauses {

Self::scan_store_attrs(&clause.body, name, attrs);

}

}

ast::Stmt::For(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

Self::scan_store_attrs(&s.orelse, name, attrs);

}

ast::Stmt::While(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

Self::scan_store_attrs(&s.orelse, name, attrs);

}

ast::Stmt::Try(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

for handler in &s.handlers {

let ast::ExceptHandler::ExceptHandler(h) = handler;

Self::scan_store_attrs(&h.body, name, attrs);

}

Self::scan_store_attrs(&s.orelse, name, attrs);

Self::scan_store_attrs(&s.finalbody, name, attrs);

}

ast::Stmt::With(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

}

_ => {}

}

}

}

/// Collect attribute names assigned via `self.xxx = ...` in methods.

/// These are stored as __static_attributes__ in the class dict.

fn collect_static_attributes(body: &[ast::Stmt], attrs: Option<&mut IndexSet<String>>) {

let Some(attrs) = attrs else { return };

for stmt in body {

// Only scan def/async def at class body level

let (params, func_body) = match stmt {

ast::Stmt::FunctionDef(f) => (&f.parameters, &f.body),

_ => continue,

};

// Get first parameter name (usually "self" or "cls")

let first_param = params

.posonlyargs

.first()

.or(params.args.first())

.map(|p| &p.parameter.name);

let Some(self_name) = first_param else {

continue;

};

// Scan function body for self.xxx = ... (STORE_ATTR on first param)

Self::scan_store_attrs(func_body, self_name.as_str(), attrs);

}

}

/// Recursively scan statements for `name.attr = value` patterns.

fn scan_store_attrs(stmts: &[ast::Stmt], name: &str, attrs: &mut IndexSet<String>) {

for stmt in stmts {

match stmt {

ast::Stmt::Assign(a) => {

for target in &a.targets {

if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) = target

&& let ast::Expr::Name(n) = value.as_ref()

&& n.id.as_str() == name

{

attrs.insert(attr.to_string());

}

}

}

ast::Stmt::AnnAssign(a) => {

if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) =

a.target.as_ref()

&& let ast::Expr::Name(n) = value.as_ref()

&& n.id.as_str() == name

{

attrs.insert(attr.to_string());

}

}

ast::Stmt::AugAssign(a) => {

if let ast::Expr::Attribute(ast::ExprAttribute { value, attr, .. }) =

a.target.as_ref()

&& let ast::Expr::Name(n) = value.as_ref()

&& n.id.as_str() == name

{

attrs.insert(attr.to_string());

}

}

ast::Stmt::If(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

for clause in &s.elif_else_clauses {

Self::scan_store_attrs(&clause.body, name, attrs);

}

}

ast::Stmt::For(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

Self::scan_store_attrs(&s.orelse, name, attrs);

}

ast::Stmt::While(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

Self::scan_store_attrs(&s.orelse, name, attrs);

}

ast::Stmt::Try(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

for handler in &s.handlers {

let ast::ExceptHandler::ExceptHandler(h) = handler;

Self::scan_store_attrs(&h.body, name, attrs);

}

Self::scan_store_attrs(&s.orelse, name, attrs);

Self::scan_store_attrs(&s.finalbody, name, attrs);

}

ast::Stmt::With(s) => {

Self::scan_store_attrs(&s.body, name, attrs);

}

ast::Stmt::Match(s) => {

for case in &s.cases {

Self::scan_store_attrs(&case.body, name, attrs);

}

}

_ => {}

}

}

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/codegen/src/compile.rs` around lines 4512 - 4597, The collector misses methods with a pos-only first parameter and doesn't descend into match statements; update collect_static_attributes to pick the first param by checking params.posonlyargs.first() before params.args.first() (or otherwise consider both lists) when computing self_name in collect_static_attributes, and extend scan_store_attrs to handle the match statement AST node (iterate each match case/body and scan them) so assignments inside match arms are discovered; refer to the functions collect_static_attributes and scan_store_attrs to locate where to change the parameter selection and add the new match-arm recursion.

coderabbitai · 2026-03-25T07:01:46Z

crates/codegen/src/ir.rs

+        // Constant folding passes
+        self.fold_unary_negative();
        self.fold_tuple_constants();
+        self.fold_list_constants();
+        self.fold_set_constants();


⚠️ Potential issue | 🟠 Major

UNARY_NEGATIVE folding now blocks the later tuple/list/set folds.

This pass leaves the consumed UNARY_NEGATIVE as a NOP, but the tuple/list/set folds that run immediately afterwards still assume the last N instructions before BUILD_* are all constant loads. After a rewrite like [1, -2, 3] -> LOAD_CONST 1; LOAD_CONST -2; NOP; LOAD_CONST 3; BUILD_LIST 3, the collection passes stop matching, so negative-element literals no longer get folded.

Also applies to: 671-677

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/codegen/src/ir.rs` around lines 192 - 196, The unary-negative folding pass (fold_unary_negative) currently leaves a NOP after consuming UNARY_NEGATIVE which prevents fold_tuple_constants/fold_list_constants/fold_set_constants from recognizing adjacent LOAD_CONSTs; change fold_unary_negative to emit a real LOAD_CONST with the negated value (i.e., replace the original LOAD_CONST + UNARY_NEGATIVE sequence with a single LOAD_CONST of the negative literal) instead of leaving a NOP so later BUILD_* collection folds will see contiguous LOAD_CONSTs; apply the same fix where the same pattern appears around the other occurrences noted (lines ~671-677) so all collection-folding passes work correctly.

coderabbitai · 2026-03-25T07:01:46Z

crates/jit/src/instructions.rs

+            Instruction::StoreFastStoreFast { var_nums } => {
+                let oparg = var_nums.get(arg);
+                let (idx1, idx2) = oparg.indexes();
+                let val1 = self.stack.pop().ok_or(JitCompileError::BadBytecode)?;
+                self.store_variable(idx1, val1)?;
+                let val2 = self.stack.pop().ok_or(JitCompileError::BadBytecode)?;
+                self.store_variable(idx2, val2)


⚠️ Potential issue | 🟠 Major

This still doesn't match the VM's NULL-capable restore path.

In crates/vm/src/frame.rs:3446-3456, the interpreter handles this opcode with pop_value_opt() and stores Option<PyObjectRef> directly into fastlocals so NULL from LoadFastAndClear survives the restore sequence. This arm still routes both pops through store_variable(), which rejects JitValue::Null/JitValue::None, so the same valid bytecode will keep bailing out of JIT compilation.

coderabbitai · 2026-03-25T07:01:46Z

crates/vm/src/frame.rs

+        // For generators/coroutines, initialize prev_line to the def line
+        // so that preamble instructions (RETURN_GENERATOR, POP_TOP) don't
+        // fire spurious LINE events.
+        let prev_line = if code
+            .flags
+            .intersects(bytecode::CodeFlags::GENERATOR | bytecode::CodeFlags::COROUTINE)
+        {
+            code.first_line_number.map_or(0, |line| line.get() as u32)
+        } else {
+            0
+        };


⚠️ Potential issue | 🟡 Minor

Don't seed prev_line with first_line_number.

This also suppresses the first real LINE event for single-line generator/coroutine bodies (def g(): yield 1, async def f(): await x), because the body shares the same source line as the RETURN_GENERATOR / POP_TOP preamble. Please suppress the preamble opcodes explicitly instead of priming prev_line to that line.

Also applies to: 734-734

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/vm/src/frame.rs` around lines 708 - 718, The code incorrectly seeds prev_line with code.first_line_number when code.flags contains CodeFlags::GENERATOR or CodeFlags::COROUTINE, which suppresses the first real LINE event for single-line generator/coroutine bodies; instead, remove the priming of prev_line and explicitly suppress the preamble opcodes (e.g., RETURN_GENERATOR and POP_TOP) when emitting LINE events by checking the opcode during event emission rather than setting prev_line based on code.first_line_number; update the initialization around prev_line and the LINE-event emission logic to skip those specific preamble opcodes while leaving prev_line at 0 so real first-line events are not lost.

- Set is_generator flag for generator expressions in scan_comprehension - Fix posonlyargs priority in collect_static_attributes first param - Add match statement support to scan_store_attrs - Fix stale decorator stack comment - Reorder NOP removal after fold_unary_negative for better collection folding

@staticmethod

* Match CPython LOAD_SPECIAL stack semantics for with/async-with LOAD_SPECIAL now pushes (callable, self_or_null) matching CPython's CALL convention, instead of a single bound method: - Function descriptors: push (func, self) - Plain attributes: push (bound, NULL) Updated all with-statement paths: - Entry: add SWAP 3 after SWAP 2, remove PUSH_NULL before CALL 0 - Normal exit: remove PUSH_NULL before CALL 3 - Exception handler (WITH_EXCEPT_START): read exit_func at TOS-4 and self_or_null at TOS-3 - Suppress block: 3 POP_TOPs after POP_EXCEPT (was 2) - FBlock exit (preserve_tos): SWAP 3 + SWAP 2 rotation - UnwindAction::With: remove PUSH_NULL Stack effects updated: LoadSpecial (2,1), WithExceptStart (7,6) * Normalize LOAD_FAST_CHECK and JUMP_BACKWARD_NO_INTERRUPT Add LOAD_FAST_CHECK → LOAD_FAST and JUMP_BACKWARD_NO_INTERRUPT → JUMP_BACKWARD to opname normalization in dis_dump.py. These are optimization variants with identical semantics. * Add EXTENDED_ARG to SKIP_OPS, normalize LOAD_FAST_CHECK and JUMP_BACKWARD_NO_INTERRUPT * Remove duplicate return-None when block already has return Skip duplicate_end_returns for blocks that already end with LOAD_CONST + RETURN_VALUE. Run DCE + unreachable elimination after duplication to remove the now-unreachable original return block. * Improve __static_attributes__ collection accuracy - Support tuple/list unpacking targets: (self.x, self.y) = val - Skip @staticmethod and @classmethod decorated methods - Use scan_target_for_attrs helper for recursive target scanning * Use method mode for function-local import attribute calls Function-local imports (scope is Local+IMPORTED) should use method mode LOAD_ATTR like regular names, not plain mode. Only module/class scope imports use plain LOAD_ATTR + PUSH_NULL. * Optimize constant iterable before GET_ITER to LOAD_CONST tuple Convert BUILD_LIST/SET 0 + LOAD_CONST + LIST_EXTEND/SET_UPDATE + GET_ITER to just LOAD_CONST (tuple) + GET_ITER, matching CPython's optimization for constant list/set literals in for-loop iterables. Also fix is_name_imported to use method mode for function-local imports, and improve __static_attributes__ accuracy (skip @classmethod/@staticmethod, handle tuple/list unpacking targets). * Fix cell variable ordering: parameters first, then alphabetical CPython orders cell variables with parameter cells first (in parameter definition order), then non-parameter cells sorted alphabetically. Previously all cells were sorted alphabetically. Also add for-loop iterable optimization: constant BUILD_LIST/SET before GET_ITER is folded to just LOAD_CONST tuple. * Emit COPY_FREE_VARS before MAKE_CELL matching CPython order CPython emits COPY_FREE_VARS first, then MAKE_CELL instructions. Previously RustPython emitted them in reverse order. * Fix RESUME AfterYield encoding to match CPython 3.14 (value 5) CPython 3.14 uses RESUME arg=5 for after-yield, not 1. Also reorder COPY_FREE_VARS before MAKE_CELL and fix cell variable ordering (parameters first, then alphabetical). * Address code review feedback from #7481 - Set is_generator flag for generator expressions in scan_comprehension - Fix posonlyargs priority in collect_static_attributes first param - Add match statement support to scan_store_attrs - Fix stale decorator stack comment - Reorder NOP removal after fold_unary_negative for better collection folding * Fold constant list/set/tuple literals in compiler When all elements of a list/set/tuple literal are constants and there are 3+ elements, fold them into a single constant: - list: BUILD_LIST 0 + LOAD_CONST (tuple) + LIST_EXTEND 1 - set: BUILD_SET 0 + LOAD_CONST (tuple) + SET_UPDATE 1 - tuple: LOAD_CONST (tuple) This matches CPython's compiler optimization and fixes the most common bytecode difference (92/200 sampled files). Also add bytecode comparison scripts (dis_dump.py, compare_bytecode.py) for systematic parity tracking. * Use BUILD_MAP 0 + MAP_ADD for large dicts (>= 16 pairs) Match CPython's compiler behavior: dicts with 16+ key-value pairs use BUILD_MAP 0 followed by MAP_ADD for each pair, instead of pushing all keys/values on the stack and calling BUILD_MAP N. * Fix clippy warnings and cargo fmt * fix surrogate

youknowone added 5 commits March 25, 2026 11:58

Fix formatting and collapsible_if clippy warnings in compile.rs

6d4ed64

Fix JIT StoreFastStoreFast, format, and remove expectedFailure markers

970a9c9

- Add StoreFastStoreFast handling in JIT instructions - Fix cargo fmt in frame.rs - Remove 11 expectedFailure markers for async jump tests in test_sys_settrace that now pass

youknowone force-pushed the bytecode-parity-phase1 branch from 6adb4fb to 9f952c7 Compare March 25, 2026 02:58

youknowone added 2 commits March 25, 2026 12:20

youknowone marked this pull request as ready for review March 25, 2026 06:48

coderabbitai bot reviewed Mar 25, 2026

View reviewed changes

youknowone merged commit ea5a6cd into RustPython:main Mar 25, 2026
19 checks passed

youknowone deleted the bytecode-parity-phase1 branch March 25, 2026 07:10

coderabbitai bot mentioned this pull request Mar 25, 2026

Bytecode parity #7504

Merged

coderabbitai bot mentioned this pull request Mar 27, 2026

Bytecode parity phase 3 #7514

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bytecode parity#7481

Bytecode parity#7481
youknowone merged 7 commits intoRustPython:mainfrom
youknowone:bytecode-parity-phase1

youknowone commented Mar 23, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 23, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

❌ Failed checks (1 inconclusive)

Uh oh!

github-actions bot commented Mar 23, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Mar 25, 2026

Uh oh!

coderabbitai bot Mar 25, 2026

Uh oh!

coderabbitai bot Mar 25, 2026

Uh oh!

coderabbitai bot Mar 25, 2026

Uh oh!

coderabbitai bot Mar 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-        // For generators and async functions, emit RETURN_GENERATOR + POP_TOP before RESUME
-        let is_gen =
-            scope_type == CompilerScope::AsyncFunction || self.current_symbol_table().is_generator;
-        if is_gen {
+        // For generators and async functions, emit RETURN_GENERATOR + POP_TOP before RESUME
+        let is_generator = self.current_symbol_table().is_generator;
+        let needs_return_generator =
+            scope_type == CompilerScope::AsyncFunction || is_generator;
+        if is_generator {
+            self.current_code_info().flags |= bytecode::CodeFlags::GENERATOR;
+        }
+        if needs_return_generator {
+            emit!(self, Instruction::ReturnGenerator);
+            emit!(self, Instruction::PopTop);
+        }

Conversation

youknowone commented Mar 23, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

❌ Failed checks (1 inconclusive)

Uh oh!

github-actions bot commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📦 Library Dependencies

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

youknowone commented Mar 23, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 23, 2026 •

edited

Loading

github-actions bot commented Mar 23, 2026 •

edited

Loading