Skip to content

Updates to deny_list recognizer#858

Merged
omri374 merged 5 commits into
mainfrom
omri/deny_list_separator
May 2, 2022
Merged

Updates to deny_list recognizer#858
omri374 merged 5 commits into
mainfrom
omri/deny_list_separator

Conversation

@omri374

@omri374 omri374 commented May 1, 2022

Copy link
Copy Markdown
Collaborator

Change Description

The existing deny_list functionality has some limitations. For example:

  1. Terms are not detected if there is no whitespace before and after
  2. Deny list confidence level is non-configurable
  3. Special characters within deny list terms are not escaped, and currently are not detected.

This PR handles 1 by modifying the deny_list regex ,2 by adding a parameter and 3 by updating the regex construction. Due to a newer version of black, some additional files were changed.

Issue reference

This PR fixes issue #857

Checklist

  • I have reviewed the contribution guidelines
  • I have signed the CLA
  • My code includes unit tests
  • All unit tests and lint checks pass locally
  • My PR contains documentation updates / additions if required

@omri374 omri374 requested a review from shiranr May 2, 2022 07:55
@omri374 omri374 merged commit 8c661fe into main May 2, 2022
@omri374 omri374 deleted the omri/deny_list_separator branch May 2, 2022 14:05
prokopidis pushed a commit to prokopidis/presidio that referenced this pull request Jun 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Deny list recognizer fails to detect terms that are not surrounded by whitespace

3 participants