riho
New member
So I want to write a script in PHP which parses SEC filings on www.sec.gov
The problem with the filings is that each company uses different structure for their filings and different layout. Some are in html and some are in plaintext.
But some keywords in the text are always the same. Like "Net Income" and "Total current assets" etc.
Here are some sample links:
http://www.sec.gov/Archives/edgar/data/40730/000095012407001502/0000950124-07-001502.txt
http://www.sec.gov/Archives/edgar/data/1050797/0000893877-99-000199.txt
The problem with the filings is that each company uses different structure for their filings and different layout. Some are in html and some are in plaintext.
But some keywords in the text are always the same. Like "Net Income" and "Total current assets" etc.
Here are some sample links:
http://www.sec.gov/Archives/edgar/data/40730/000095012407001502/0000950124-07-001502.txt
http://www.sec.gov/Archives/edgar/data/1050797/0000893877-99-000199.txt