Home Unexpected behavior with regular expressions

Questions

Unexpected behavior with regular expressions

January 20, 2023

I am trying to write a parser that detects bibliography footnotes, using regular expressions. But a particular RE is not working, and I cannot figure out why. Here is the code where I isolated the problem.

import re
PATTERN = "[\\w ]+, [\\w ]+, (\\d+(\\-\\d+)?)\\."

match_A = re.search(PATTERN, "Author, Some Book, 51–66.")
match_B = re.search(PATTERN, "Author, Some Book, 60-61.")

print(match_A != None)
print(match_B != None)

SUB_PATTERN = "\\d+(\\-\\d+)?"

match_C = re.search(SUB_PATTERN, "51–66")
match_D = re.search(SUB_PATTERN, "60–61")

print(match_C != None)
print(match_D != None)

The result is:

False
True
True
True

But I expect to obtain all True.
Can anybody reproduce this issue, or explain what is happening to me?

I am working on Windows 10. My Python version:

Python 3.11.1 (tags/v3.11.1:a7a450f, Dec  6 2022, 19:58:39) [MSC v.1934 64 bit (AMD64)] on win32

>Solution :

Your dashes are different, the first one is a "–" ("en dash") and the second one is a "-" ("hyphen"). If you don’t believe me, google each one. You can put them into a character class:

PATTERN = "[\\w ]+, [\\w ]+, (\\d+([–-]\\d+)?)\\."

python-re

byMR

Published January 20, 2023

Add a comment

How to find the column number and return it as an array when there is a value in that column in python?

byMR

January 20, 2023

Questions

Why the output is 4 and not 16 in this C code?

byMR

January 20, 2023

Questions

Bash – [: =: unary operator expected during trap execution

byMR

January 20, 2023

Questions

CASE WHEN statement in WHERE clause Postgresql

byMR

January 20, 2023

Questions

Weighted average of a dictionary – Pandas

byMR

January 20, 2023

Questions

TLE in C but not in C++

byMR

January 20, 2023

Unexpected behavior with regular expressions

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to find the column number and return it as an array when there is a value in that column in python?

Why the output is 4 and not 16 in this C code?

Bash – [: =: unary operator expected during trap execution

CASE WHEN statement in WHERE clause Postgresql

Weighted average of a dictionary – Pandas

TLE in C but not in C++

Keep Up to Date with the Most Important News

Unexpected behavior with regular expressions

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to find the column number and return it as an array when there is a value in that column in python?

Why the output is 4 and not 16 in this C code?

Bash – [: =: unary operator expected during trap execution

CASE WHEN statement in WHERE clause Postgresql

Weighted average of a dictionary – Pandas

TLE in C but not in C++

Discover more from Dev solutions