# Import library
import requests
STEP 3. SELECTING A PAGE
In this project, we will use webscraper.io . Since this website is built in HTML, the code is easier and more understandable even for beginners. We chose this page to scrape the data:
Webscrapper.io
It is a prototype of an online store website. We will parse data about computers and laptops, such as product name, price, description and reviews.
STEP 4. REQUEST FOR PERMISSION
Once we select a page, we copy its URL and use ecuador telegram database
request to ask the server for permission to retrieve data from their site.
# Define URL
url = ‘https://webscraper.io/test-sites/e-comm ... rs/laptops'#
Ask hosting server to fetch url
requests.get(url)
The <Response[200]> result means that the server allows us to collect data from their website. We can use the request.get function to check.
pages = requests.get(url)
pages.text
When you run this code, you will get a jumbled text output that is not suitable for Python. We need to use a parser to make it more readable.
# parser-lxml = Change html to Python friendly format
soup = BeautifulSoup(pages.text, ‘lxml’)
soup
From bs4 import BeautifulSoup
-
- Posts: 783
- Joined: Thu Jan 02, 2025 7:44 am