python extract title tag from url and html using regex [ python ]

Posted in python

1:30 am, April 4, 2021

python extract title tag from url and html using regex

this will extract the title tag as text from the url and the title tag in the following python script

import re
from urllib.request import urlopen
url = "http://olympus.realpython.org/profiles/dionysus"
page = urlopen(url)
html = page.read().decode("utf-8")
pattern = "<title.*?>.*?</title.*?>"
match_results = re.search(pattern, html, re.IGNORECASE)
title = match_results.group()
title = re.sub("<.*?>", "", title) # Remove HTML tags
print(title)

Posted in python

1:30 am, April 4, 2021

python extract title tag from url and html using regex

python extract title tag from url and html using regex

python extract title tag from url and html using regex

python extract title tag from url and html using regex

Python

View Statistics

Add Comment

Other Items in python

Related Search Terms

Other Categories in Code

Search Code

Latest from Code

Welcome

Random Quote

Random CSS Property

:past

Latest Articles

Code

Links

Quick Links

Quick Links

Sections