Clean SEC EDGAR data · one-time download

Analysis-ready SEC data,
without the XBRL headache.

Normalized S&P 500 fundamentals, Form 4 insider trades, and the latest quarter of 13F holdings — as clean CSV & Parquet, with a data dictionary and a Jupyter notebook that actually runs. Point-in-time snapshots. Instant download.

3 packs 448k+ rows total Every row traces to its SEC filing Instant delivery

The packs

Each pack ships with the data (CSV and/or Parquet), a full data dictionary, a runnable Jupyter notebook, a 100-row free sample, and a manifest with SHA-256s and the extract date. Extract date: 2026-07-05.

CSV 148,250 rows

S&P 500 Fundamentals

187 companies · income · balance sheet · cash flow

Normalized financials for S&P 500 constituents — revenue, margins, EPS, assets, debt, cash flow — one clean typed row per statement and period. Every figure carries the source filing URL.

  • Income, balance-sheet & cash-flow columns
  • Annual / quarterly / YTD periods
  • Traceable to each 10-K / 10-Q
Parquet 77,835 rows

Insider Trades (Form 4)

every Form 4 transaction line

The full Form 4 insider-transaction table — who bought or sold, how many shares, at what price, and their relationship to the company. Ships as Parquet (+ gzip CSV).

  • Buy/sell codes, shares, price, value
  • Officer / director / 10%-owner roles
  • Filter P/S for real signal — notebook shows how
Parquet 222,012 rows

Latest-Quarter 13F Holdings

2026-Q1 · institution × position

The newest quarter of institutional 13F holdings — every manager, every position, value and share count, with voting authority. Ships as Parquet (+ gzip CSV).

  • Value, shares, CUSIP, class
  • Sole / shared voting authority
  • Top-holders & manager-book recipes in the notebook
Point-in-time snapshot — the extract date is on every file

Why we sell snapshots, not "live-ish" files

A static download can't stay fresh — so we don't pretend it does. Every pack is frozen as of a stamped date. That's the honest, reproducible way to buy filing data.

Reproducible

The numbers won't shift under your analysis. "As of 2026-07-05" is unambiguous and citable.

Traceable

Every row carries the SEC accession_number and filing_url. Nothing is invented — it reconciles to the public filing.

Instant, keep it forever

Download on purchase. No account, no subscription, nothing to cancel — so nothing to refund-chase.

Need it continuously updated?

The packs are the same data, frozen and bulk-delivered. If you want it refreshed daily from SEC EDGAR via a JSON API, that's Filingrail.

Explore the Filingrail API →