Help! finding the content of tags

lalala12345 · December 22, 2021, 4:58pm

Hi! I need to write a programme that use urllib to read the HTML from a file, and parse the data, extracting numbers and compute the sum of the numbers in the file. I have to to find all the span tags in the file and pull out the numbers from the tag and sum the numbers.
I’ve written this:
from urllib.request import urlopen
from bs4 import BeautifulSoup

url = input('Enter - ')
html = urlopen(url, context=ctx).read()
soup = BeautifulSoup(html, “html.parser”)

tags = soup(‘span’)
for tag in tags:

sum = sum+int(tag.contents[0])

print(sum)

But when I run the programme it appears:

Traceback (most recent call last):
File “C:\Users\Izan\Documents\folder2\html2.py”, line 6, in
from bs4 import BeautifulSoup
File “C:\Users\Izan\Documents\folder2\bs4_init_.py”, line 30, in
from .builder import builder_registry, ParserRejectedMarkup
File “C:\Users\Izan\Documents\folder2\bs4\builder_init_.py”, line 4, in
from bs4.element import (
File “C:\Users\Izan\Documents\folder2\bs4\element.py”, line 8, in
from bs4.dammit import EntitySubstitution
File “C:\Users\Izan\Documents\folder2\bs4\dammit.py”, line 13, in
from html.entities import codepoint2name
ModuleNotFoundError: No module named ‘html.entities’; ‘html’ is not a package

Any ideas?

lalala12345 · December 22, 2021, 5:49pm

I’ve tried it this way and it does not work either:
import urllib.request, urllib.parse, urllib.error
from bs4 import BeautifulSoup
import ssl
numlist =

fhand = urllib.request.urlopen(‘http://…’)
html = urllib.request.urlopen(url, context=ctx).read()
soup = BeautifulSoup(html, ‘html.parser’)

tags = soup(‘span’)
for tag in tags:

numlist.append(int(tag))

print (sum(numlist) )

Topic		Replies	Views
Using phyton to analyse html Python Help	0	323	December 19, 2021
Newbie Here looking for code assistance Python Help help	2	313	July 8, 2023
Need Help with Web Scraping Python Help help	1	346	July 4, 2023
Python Script Not Parsing Website Data Correctly Python Help	6	780	February 17, 2024
Parsing HTML with the XML module Python Help	5	879	November 17, 2021

Help! finding the content of tags

Related Topics