bpo-32477: Move jumps optimization from the peepholer to the compiler. #5077

serhiy-storchaka · 2018-01-01T22:06:03Z

https://bugs.python.org/issue32477

pitrou · 2018-01-02T13:27:42Z

This doesn't look like it's really removing a peephole optimization. Instead, it moves a peephole optimization from the peepholer to the compiler, but the implementation looks similar. At least having the peepholer in a separate file means the compiler doesn't get more complicated by optimizations.

markshannon · 2018-07-29T14:55:25Z

The changes to importlib.h and importlib_external.h imply that this PR is behaviour changing.
That they shrink suggests that this might be an improvement.

What are the changes? Are there branches that are now removed that weren't before?

methane · 2018-12-03T09:42:31Z

@markshannon This is diff of importlib dis: http://paste.ubuntu.com/p/4yn278YDrV/

This PR removes unused codes after RETURN_VALUE. It merges two step unconditional JUMPs into one JUMP.

vstinner · 2018-12-06T15:58:09Z

What's the status of this PR?

@serhiy-storchaka asked me to not fix compiler warnings with my PR #10652 before this PR is merged.

@pitrou doesn't seem to like the overall idea:

I'm not sure if @rhettinger likes the idea (but his comment is not really on the PR itself):

https://bugs.python.org/issue32477#msg309432

I understand that this change allows to remove more deadcode. Would it be possible to get the same effect by organizing the peephole optimizer differently? For example, would we also remove more deadcode by running the peephole optimizer twice? I'm trying to understand if removing more deadcode can only be done by moving the jump optimization inside the compiler.

I have no opinion on this PR at this point :-)

vstinner · 2018-12-06T16:06:44Z

EDIT: _module_repr_from_spec() code was wrong, I fixed it.

I tested on this Python code:

def _module_repr_from_spec(spec):
    """Return the repr to use for the module."""
    # We mostly replicate _module_repr() using the spec attributes.
    name = '?' if spec.name is None else spec.name
    if spec.origin is None:
        if spec.loader is None:
            return '<module {!r}>'.format(name)
        else:
            return '<module {!r} ({!r})>'.format(name, spec.loader)
    else:
        if spec.has_location:
            return '<module {!r} from {!r}>'.format(name, spec.origin)
        else:
            return '<module {!r} ({})>'.format(spec.name, spec.origin)

I ran the peephole optimizer twice:

diff --git a/Python/compile.c b/Python/compile.c
index 45e78cb22c..7d361c6af0 100644
--- a/Python/compile.c
+++ b/Python/compile.c
@@ -5577,6 +5577,13 @@ makecode(struct compiler *c, struct assembler *a)
     if (!bytecode)
         goto error;
 
+    PyObject *bytecode2 = PyCode_Optimize(bytecode, consts, names, a->a_lnotab);
+    Py_CLEAR(bytecode);
+    bytecode = bytecode2;
+    if (!bytecode) {
+        goto error;
+    }
+
     tmp = PyList_AsTuple(consts); /* PyCode_New requires a tuple */
     if (!tmp)
         goto error;

The bytecode is different.

Before:

             60 CALL_METHOD              2
             62 RETURN_VALUE
             64 JUMP_FORWARD            36 (to 102)

 13     >>   66 LOAD_FAST                0 (spec)
             68 LOAD_ATTR                4 (has_location)
             70 POP_JUMP_IF_FALSE       86

After:

             60 CALL_METHOD              2
             62 RETURN_VALUE

 13     >>   64 LOAD_FAST                0 (spec)
             66 LOAD_ATTR                4 (has_location)
             68 POP_JUMP_IF_FALSE       84

=> the JUMP_FORWARD has been removed: it was deadcode. As a side effect, offsets and jump targets are different. Example: LOAD_FAST moves from offset 66 to 62, and "POP_JUMP_IF_FALSE 86" becomes "POP_JUMP_IF_FALSE 84".

The question is now if we can modify the peephole to get the same result when it's only run once, or if we have to move the jump optimization to the compiler.

vstinner · 2018-12-06T16:26:16Z

I can be wrong, but it seems like the optimizer doesn't remove basic blocks which are deadcode.

I implemented such optimization in my peepholer optimizer (for Python bytecode) written in pure Python:

https://github.com/vstinner/bytecode/blob/aa2d29f018cfd4a5fc0dea5503eb280a5dfd37ab/bytecode/peephole_opt.py#L436-L454

peephole.c uses an hash table: offset => block index. My bytecode project uses an additiona "next_block" information. It is used in the dead code removal to follow the chain of blocks and check which basic blocks are dead code.

compile.c uses a different structure for basic blocks than peephole.c:

typedef struct basicblock_ {
    ...
    /* If b_next is non-NULL, it is a pointer to the next
       block reached by normal control flow. */
    struct basicblock_ *b_next;
    ...
} basicblock;

Maybe this one can be used to detect unreachable basic blocks?

I would suggest to leave peephole.c as it is, but try to remove unreachable basic blocks in compile.c.

What do you think @serhiy-storchaka?

vstinner · 2018-12-06T16:27:28Z

Note: the pure Python peephole optimizer has a few doc at https://bytecode.readthedocs.io/en/latest/peephole.html#optimizations

vstinner · 2018-12-07T14:09:19Z

I discussed with @serhiy-storchaka on IRC, he told me:

We need to optimize jumps before removing unreachable blocks, because jumps optimization can create unreachable blocks.
We can not remove some instructions that are used in frame_setlineno() to determining the end of logical blocks.

For the first point, I think that it's fine. We can organize PyCode_Optimize() to first optimize jumps, and only later remove unreachable blocks.

For the second point, maybe that's a reference to this comment in peephole.c?

                /* END_FINALLY should be kept since it denotes the end of
                   the 'finally' block in frame_setlineno() in frameobject.c.
                   SETUP_FINALLY should be kept for balancing.
                 */
                while (h < codelen && ISBASICBLOCK(blocks, i, h) &&
                       _Py_OPCODE(codestr[h]) != END_FINALLY)
                { ... }

markshannon · 2021-03-02T10:42:43Z

I'm closing this, as it is obsolete.

serhiy-storchaka added skip news performance Performance or resource usage labels Jan 1, 2018

serhiy-storchaka requested a review from a team January 1, 2018 22:06

the-knights-who-say-ni added the CLA signed label Jan 1, 2018

bedevere-bot added the awaiting merge label Jan 1, 2018

serhiy-storchaka force-pushed the optimize-jumps branch from a6a670a to 99cf21b Compare February 24, 2018 19:47

brettcannon removed the request for review from a team March 23, 2018 21:11

serhiy-storchaka mentioned this pull request Dec 1, 2018

bpo-9566: Fix compiler warnings in peephole.c #10652

Merged

serhiy-storchaka force-pushed the optimize-jumps branch from 99cf21b to c8d598e Compare December 3, 2018 08:29

serhiy-storchaka added 2 commits December 8, 2018 10:04

bpo-32477: Move jumps optimization from the peepholer to the compiler.

a8025f0

Always use a new block after jumps.

7f7c575

serhiy-storchaka force-pushed the optimize-jumps branch from c8d598e to 7f7c575 Compare December 8, 2018 08:45

serhiy-storchaka added 5 commits March 7, 2019 15:42

Merge branch 'master' into optimize-jumps

8dc6bea

Merge branch 'master' into optimize-jumps

745e924

Add more tests.

ca043f6

Merge branch 'master' into optimize-jumps

9d45981

Fixes.

57d3f91

markshannon closed this Mar 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-32477: Move jumps optimization from the peepholer to the compiler. #5077

bpo-32477: Move jumps optimization from the peepholer to the compiler. #5077

Uh oh!

serhiy-storchaka commented Jan 1, 2018 •

edited by bedevere-bot

Loading

Uh oh!

pitrou commented Jan 2, 2018

Uh oh!

markshannon commented Jul 29, 2018

Uh oh!

methane commented Dec 3, 2018

Uh oh!

vstinner commented Dec 6, 2018

Uh oh!

vstinner commented Dec 6, 2018 •

edited

Loading

Uh oh!

vstinner commented Dec 6, 2018

Uh oh!

vstinner commented Dec 6, 2018

Uh oh!

vstinner commented Dec 7, 2018

Uh oh!

markshannon commented Mar 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Uh oh!

bpo-32477: Move jumps optimization from the peepholer to the compiler. #5077

bpo-32477: Move jumps optimization from the peepholer to the compiler. #5077

Uh oh!

Conversation

serhiy-storchaka commented Jan 1, 2018 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pitrou commented Jan 2, 2018

Uh oh!

markshannon commented Jul 29, 2018

Uh oh!

methane commented Dec 3, 2018

Uh oh!

vstinner commented Dec 6, 2018

Uh oh!

vstinner commented Dec 6, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vstinner commented Dec 6, 2018

Uh oh!

vstinner commented Dec 6, 2018

Uh oh!

vstinner commented Dec 7, 2018

Uh oh!

markshannon commented Mar 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

serhiy-storchaka commented Jan 1, 2018 •

edited by bedevere-bot

Loading

vstinner commented Dec 6, 2018 •

edited

Loading