Webscraping - iherb - ratings

JonasChan0414 · October 9, 2022, 7:06am

Hi all.
I’m trying to web scarp iherb but I’m having trouble extrating the data of the ratings (aka the number of stars of each product). Here’s my work:

import requests
from bs4 import BeautifulSoup as soup

url = ‘Whey Protein • Whey Protein Isolate and Concentrate | iHerb’
html = requests.get(url=url,headers=header_1)
html.status_code

bsobj = soup(html.content, ‘lxml’)

whey = bsobj.findAll(‘div’,{‘class’:‘product-inner product-inner-wide’})

for item in whey:
dict_1 = {
“title” : item.find(‘div’,{“class”:“product-title”}).text,
“link” : item.find(‘a’,{“class”:“absolute-link product-link”})[‘href’],
“reviews” : int(item.find(‘a’,{“class”:“rating-count”}).text),
“stars” : item.find(‘a’,{“class”:“stars”}).text
}

so the problem is I can’t get the number of stars out of 5 of each product. What should I do? Many thanks.

barry-scott · October 9, 2022, 7:23pm

Is the stars in the HTML that you fetched?
I suggest that you save the page in a file after your requests.get()
so that you can check where the stars are.

Often these days a web page is made on the by javascript code.
If so then you will not find the stars in the HTML.

You may need to use a tool like selenium to load the web page then you can query for the stars after the javascript has run.

Topic		Replies	Views
Loops on Python Python Help help	13	268	April 10, 2024
Problems with python var not displayed in html/java script Python Help	4	419	September 12, 2022
Programme pour tableau avancement chimie terminale Python Help help	1	967	November 1, 2021
New to python coding and need answers as example Python Help	3	309	November 21, 2023
How can i classify this data using Python? Python Help help	4	373	December 3, 2022

Webscraping - iherb - ratings

Related Topics