bpo-42218: Check for the error indicator when expecting a new token #23058

isidentical · 2020-10-31T12:06:43Z

https://bugs.python.org/issue42218

serhiy-storchaka · 2020-10-31T13:31:08Z

Parser/pegen.c

 {
    Token *t = _PyPegen_expect_token(p, NAME);
-    if (t == NULL) {
+    if (t == NULL || p->error_indicator) {


How can _PyPegen_expect_token() return non-NULL if error is set?

Either _PyPegen_name_token() is called when error is set (it should not, it should be checked earlier), or the problem is in _PyPegen_fill_token().

Add assert(!PyErr_Occurred()) at the top of _PyPegen_expect_token() and if it does not crash, search bug in _PyPegen_fill_token().

How can _PyPegen_expect_token() return non-NULL if error is set?

As far as I understand, _PyPegen_expect_token returns NULL on 2 different cases. If there is an error, it returns NULL and if the given type is not matched (e.g: NAME given but NUMBER is the next token) it also returns NULL. But on the latter, it returns NULL without setting an error. So when parser is trying alternatives and backtracking, it assumes it got a different type of token and continues.

One thing that I thought was adding

if (p->error_indicator) { return NULL; }

at the begging of _PyPegen_expect_token but I failed to find another case where the error didn't propagate except the one that author suggested so this might be enough.

serhiy-storchaka · 2020-10-31T13:39:30Z

Lib/test/test_syntax.py

        self._check_error(code, "invalid syntax")

+    def test_unexpected_line_continuation(self):
+        self._check_error('A.\u018a\\ ', "unexpected character after line continuation character")


I would test two cases: \\\n and \\ followed by non-\n character. They are two slightly different errors.

Instead \u018a you can use any non-ascii character.

lysnikolaou · 2020-10-31T16:57:11Z

I think that this is too specialized a fix for this specific situation. I've investigated a bit more in depth and I think I found a problem there is with left-recursive rules, not handling p->error_indicator correctly. I've got a patch ready. @isidentical Would it be okay to close this PR?

isidentical · 2020-10-31T17:21:57Z

Sure!

bpo-42218: Check for the error indicator when expecting a new token

e911f2d

isidentical requested review from lysnikolaou and pablogsal as code owners October 31, 2020 12:06

the-knights-who-say-ni added the CLA signed label Oct 31, 2020

bedevere-bot added the awaiting review label Oct 31, 2020

isidentical marked this pull request as draft October 31, 2020 12:07

Zac-HD mentioned this pull request Oct 31, 2020

New failure on Python3.9 Zac-HD/hypothesmith#12

Closed

serhiy-storchaka self-requested a review October 31, 2020 13:07

serhiy-storchaka reviewed Oct 31, 2020

View reviewed changes

isidentical closed this Oct 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-42218: Check for the error indicator when expecting a new token #23058

bpo-42218: Check for the error indicator when expecting a new token #23058

Uh oh!

isidentical commented Oct 31, 2020 •

edited by bedevere-bot

Loading

Uh oh!

serhiy-storchaka Oct 31, 2020

Uh oh!

isidentical Oct 31, 2020 •

edited

Loading

Uh oh!

serhiy-storchaka Oct 31, 2020

Uh oh!

lysnikolaou commented Oct 31, 2020

Uh oh!

isidentical commented Oct 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

bpo-42218: Check for the error indicator when expecting a new token #23058

bpo-42218: Check for the error indicator when expecting a new token #23058

Uh oh!

Conversation

isidentical commented Oct 31, 2020 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

serhiy-storchaka Oct 31, 2020

Choose a reason for hiding this comment

Uh oh!

isidentical Oct 31, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka Oct 31, 2020

Choose a reason for hiding this comment

Uh oh!

lysnikolaou commented Oct 31, 2020

Uh oh!

isidentical commented Oct 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

isidentical commented Oct 31, 2020 •

edited by bedevere-bot

Loading

isidentical Oct 31, 2020 •

edited

Loading