07. Regular Expressions

Regular Expressions

A RegEx, or Regular Expression is a sequence of characters that forms a search pattern. RegEx can be used to check if a string contains the specified search pattern.

Python has a built-in package called re, which can be used to work with Regular Expressions. Import the re module:

When you have imported the re module, you can start using regular expressions:

For Example:
import re

#Check if the string starts with "The" and ends with "Spain":

TXT = "The rain in Spain"
x = re.search("The rain", TXT )

if (x):
     print("YES! We have a match!")
else:
    print("No match")

Output:
Yes! We have a match

Regular Expression Functions

findall()

The findall() function returns a list containing all matches.

For Example :

import re
str = "The rain in Spain"   #Return a list containing every occurrence of "ai":


x = re.findall("ai", str)

print(x)

Output

[‘ai’, ‘ai’]

search()

The search() function searches the string for a match, and returns a match object if there is a match. If there is more than one match, only the first occurrence of the match will be returned:

For Example :

import re
TXT = "The rain in Spain"

x = re.search('world', txt)

if x:
     print("pattern found inside the string")
else:
     print("pattern not found")

Output :
pattern not found

split()

The split() function returns a list where the string has been split at each match

For Example :

import re
TXT = "The rain in Spain"
x = re.split("\s", TXT)
print(x)

Output:
['The', 'rain', 'in', 'Spain']

sub()

The sub() function replaces the matches with the text of your choice

For Example :

import re
str = "The rain in Spain"
x = re.sub("\s", ",", str)
print(x)

Output:
The,rain,in,Spain