mirror of
https://github.com/wassname/sec-web-scraper-13f.git
synced 2026-06-27 17:50:04 +08:00
6d711ba63b708c856fc22d36d9085941da7e4b27
EDGAR Python Web Scraper
This repository contains Gary Pang's Python Web scraper for parsing fund holdings pulled from SEC website, EDGAR, and writing a .tsv file from the data.
Requirements
Getting Started
pip install -r requirements.txt(orpipenv installif you are using pipenv)python scraper.py(orpipenv run python scraper.py)- When prompted, enter the 10-digit CIK number of a mutual fund
Key Dependencies
- Requests, Python library for making HTTP requests
- lxml, Python library for processing XML and HTML
- Beautiful Soup, Python library for scraping information from Web pages
- re, Python module for using regular expressions
- csv, Python module for parsing and writing CSV and TSV files
Contributor
References
Languages
Jupyter Notebook
99.3%
Python
0.7%