bpo-37751: Document the change in What's New in Python 3.9 #17997

vstinner · 2020-01-14T12:40:01Z

https://bugs.python.org/issue37751

vstinner · 2020-01-14T12:40:12Z

cc @malemburg

vstinner · 2020-01-14T12:48:41Z

Oops, the change was done in Python 3.9, not in Python 3.8! PR updated.

corona10

lgtm

serhiy-storchaka · 2020-01-14T14:58:48Z

Doc/whatsnew/3.9.rst

  :data:`~errno.EBADF` error.
  (Contributed by Victor Stinner in :issue:`39239`.)

+* :func:`codecs.lookup` now normalizes the encoding name the same way than


There are other differences. For example, normalize_encoding("КОИ-8") returns "кои_8", but codecs.lookup normalizes it to "8".

The comment in the sources is also not correct.

encodings.normalize_encoding() says "Note that encoding names should be ASCII only." You're correct: "КОИ-8" is normalized to "8" by codecs.lookup() because the C function _Py_normalize_encoding() ignores non-ASCII letters.

I don't know which behavior is correct. It sounds strange to me to have a non-ASCII encoding name. Which encoding is supposed to be used to encoding the encoding name?!? :-D Maybe encodings.normalize_encoding() should also ignore non-ASCII letters, be more strict.

I created bpo-39337: codecs.lookup() ignores non-ASCII characters, whereas encodings.normalize_encoding() copies them.

Maybe encodings.normalize_encoding() should also ignore non-ASCII letters, be more strict.

Hm, the annotation of normalize_encoding have the words: Note that encoding names should be ASCII only.
+1 for encodings.normalize_encoding() should be similar as _Py_normalize_encoding().
And I created a PR: #22219

There are other differences. For example, normalize_encoding("КОИ-8") returns "кои_8", but codecs.lookup normalizes it to "8".

After #22219 merged, this problem have been fixed(MAYBE enhanced will be more exact).

encukou · 2020-01-14T15:09:29Z

Doc/whatsnew/3.9.rst

  :data:`~errno.EBADF` error.
  (Contributed by Victor Stinner in :issue:`39239`.)

+* :func:`codecs.lookup` now normalizes the encoding name the same way than


Suggested change

* :func:`codecs.lookup` now normalizes the encoding name the same way than

* :func:`codecs.lookup` now normalizes the encoding name the same way as

Oh. I copied the NEWS entry from commit 20f59fe. If there is a typo, it should also be fixed in Misc/NEWS.d/next/Core and Builtins/2019-08-20-04-36-37.bpo-37751.CSFzUd.rst.

I create a new PR in #23096, and this word have been replaced.

csabella · 2020-05-22T22:13:33Z

@vstinner is this one that needs to be merged soon?

vstinner · 2020-10-01T16:23:19Z

@shihai1991: I failed to find time to finish this PR. Since you are wokring on bpo-39337, do you want to continue the work on this PR? You can steal it (copy/paste my change), and try to address previous comments.

shihai1991 · 2020-10-01T16:53:50Z

@shihai1991: I failed to find time to finish this PR. Since you are wokring on bpo-39337, do you want to continue the work on this PR? You can steal it (copy/paste my change), and try to address previous comments.

No problem, I will take a look :)

shihai1991 · 2020-11-02T05:06:35Z

A new PR in #23096 (copy from this PR).

vstinner added the skip news label Jan 14, 2020

the-knights-who-say-ni added the CLA signed label Jan 14, 2020

bedevere-bot added docs Documentation in the Doc dir awaiting core review labels Jan 14, 2020

vstinner added the needs backport to 3.8 label Jan 14, 2020

bpo-37751: Document the change in What's New in Python 3.9

7bfe9b2

vstinner changed the title ~~bpo-37751: Document the change in What's New in Python 3.8~~ bpo-37751: Document the change in What's New in Python 3.9 Jan 14, 2020

vstinner removed the needs backport to 3.8 label Jan 14, 2020

corona10 approved these changes Jan 14, 2020

View reviewed changes

serhiy-storchaka reviewed Jan 14, 2020

View reviewed changes

encukou reviewed Jan 14, 2020

View reviewed changes

vstinner closed this Jan 19, 2021

vstinner deleted the codecs_whatsnew38 branch January 19, 2021 21:59

	* :func:`codecs.lookup` now normalizes the encoding name the same way than
	* :func:`codecs.lookup` now normalizes the encoding name the same way as

Uh oh!

bpo-37751: Document the change in What's New in Python 3.9 #17997

bpo-37751: Document the change in What's New in Python 3.9 #17997

Uh oh!

Conversation

vstinner commented Jan 14, 2020 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vstinner commented Jan 14, 2020

Uh oh!

vstinner commented Jan 14, 2020

Uh oh!

corona10 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

csabella commented May 22, 2020

Uh oh!

vstinner commented Oct 1, 2020 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shihai1991 commented Oct 1, 2020

Uh oh!

shihai1991 commented Nov 2, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

vstinner commented Jan 14, 2020 •

edited by bedevere-bot

Loading

vstinner commented Oct 1, 2020 •

edited by bedevere-bot

Loading