KashmalaJamshaid/Web-scraping-using-python-and-beautifulsoup
This notebook includes data scraping. For this beautifulsoup and selinium is used. It takes a website URL as an input and extracts the information listed below as an output from that webpage. For this beautifulsoup and selinium is used 1. Specific HTML tags along with titles and meta description 2. Extract specific tags, heading tags from h1-h6 along with titles and meta description 3. Extracting ALT tags 4. For counting words inside a web page 5. Inspection of broken links inside a webpage 6. Extracting the source code of the webpage
No commits in the last 6 months.
Stars
10
Forks
9
Language
Jupyter Notebook
License
—
Category
Last pushed
Aug 04, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/KashmalaJamshaid/Web-scraping-using-python-and-beautifulsoup"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
soxoj/maigret
🕵️♂️ Collect a dossier on a person by username from 3000+ sites
0x676e67/wreq-python
An ergonomic Python HTTP Client with TLS fingerprint