Regexp python - finding substring

Question

How could I find all instances of a substring in a string?

For example I have the string ("%1 is going to the %2 with %3"). I need to extract all placeholders in this string (%1, %2, %3)

The current code could only find the first two because the ending is not a white space.

import re
string = "%1 is going to the %2 with %3"


r = re.compile('%(.*?) ')
m = r.finditer(string)
for y in m:
 print (y.group())

Martijn Pieters · Accepted Answer · 2013-05-08 19:37:06Z

5

Don't match on whitespace, match on a word boundary instead using \b:

r = re.compile(r'%(.*?)\b')

You may want to restrict your characters to word characters only instead of the . wildcard, and match at least one character:

r = re.compile(r'%(\w+)\b')

You don't appear to be using the capturing group either, so you could just omit that:

r = re.compile(r'%\w+\b')

edited May 8, 2013 at 19:37

answered May 8, 2013 at 19:24

Martijn Pieters

1.1m326 gold badges4.2k silver badges3.4k bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

smilelife Over a year ago

Thanks Martijn, what if the placeholder was to change from %1 to %1%, how could I extract that entire substring?

Martijn Pieters Over a year ago

Then you can just match on the second %: r'%(\w+)%'.

smilelife Over a year ago

What if that is an unknown character, it could be a %, $ or even %%?

Alfe Over a year ago

Only word-constituents would pose a problem (matching \w) for obvious reasons.

Martijn Pieters Over a year ago

@user1322582: what are you trying to do? You can create a dynamic regular expression, or you can use a character class to match the possible characters: r'(?P<meta>(%|%%|$|$$)\w+(?P=meta)' would match one of 4 different meta characters, provided that the same meta character(s) are used after the name as well.

|

Collectives™ on Stack Overflow

Regexp python - finding substring

1 Answer 1

8 Comments

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

8 Comments

Related