Skip to Content
Learn
Web Scraping with Beautiful Soup
Object Types

BeautifulSoup breaks the HTML page into several types of objects.

Tags

A Tag corresponds to an HTML Tag in the original document. These lines of code:

soup = BeautifulSoup('<div id="example">An example div</div><p>An example p tag</p>') print(soup.div)

Would produce output that looks like:

<div id="example">An example div</div>

Accessing a tag from the BeautifulSoup object in this way will get the first tag of that type on the page.

You can get the name of the tag using .name and a dictionary representing the attributes of the tag using .attrs:

print(soup.div.name) print(soup.div.attrs)
div {'id': 'example'}

NavigableStrings

NavigableStrings are the pieces of text that are in the HTML tags on the page. You can get the string inside of the tag by calling .string:

print(soup.div.string)
An example div

Instructions

1.

Print out the first p tag on the shellter.html page.

2.

Print out the string associated with the first p tag on the shellter.html page.

Folder Icon

Sign up to start coding

Already have an account?