web-scraping

Web scraping pipelines with requests and BeautifulSoup

For lightweight data collection, I prefer reliable HTML parsing over brittle browser automation. That means stable headers, polite rate limiting, retries, and explicit extraction rules. If scraping becomes core infrastructure, then I graduate it into