short courses · Self-paced
Structured Web Scraping
Ethical scraping with rate limits, parsing pipelines, and storage hygiene.
- Duration
- 3 weeks
- Level
- Intermediate
- Schedule
- Flexible
- Price
- ₩240,000
Overview
Build parsers that respect robots.txt and site terms. Focus on BeautifulSoup and httpx with caching, not brute-force crawling at scale.
What is included
- —httpx async fetch patterns
- —BeautifulSoup parsing strategies
- —Rate limiting and backoff
- —Storage to Parquet and CSV
- —Ethics and terms review checklist
Outcomes
- →Ship a parser with documented rate limits
- →Handle pagination without hammering hosts
- →Store outputs with schema metadata
Sora Kim
Curriculum designer specializing in API integration literacy.
FAQ
We teach ethics frameworks; we do not provide legal advice on specific sites.
Reviews
"Anonymous — rate-limit module prevented us from getting blocked during pilot."
"Pagination lab was the clearest explanation I have seen."