Public NYC DOC Research Explorer

NYC Criminal Justice Explorer

Person-centric exploration of NYC DOC admissions, repeat incarceration, charge patterns, and the cleanest public arrest-to-jail bridge subset.

196,713 people447,391 jail episodes43.9% returned 2+ times32.5% within 1 year

How To Read This Explorer

What is exact and what is not

The site centers exact NYC DOC jail histories. Arrest context is narrower and appears only when the public data supports a candidate bridge.

  • Exact. DOC person pages, episode counts, stay lengths, gap lengths, and cohort return rates come from exact joins inside the public DOC feeds.
  • Candidate. Arrest rows and map points are candidate matches only. They come from a strict 1:1 bridge, not a full cross-system person key.
  • Unsupported. Court outcomes, prison histories, parole, and citywide multi-arrest identity resolution remain outside what public bulk data can support here.

Methods + Sources

What the app is built from

The current web app reads four derived Parquet outputs built from public NYC DOC and NYPD data. The broader repo also carries a larger public event panel for research work outside these routes.

Core runtime inputs are DOC admissions, DOC discharges, DOC cohort outcomes, and the candidate arrest-DOC bridge.

The broader repo also includes NYPD complaints, arrests, summonses, and Census geography enrichment for the multi-year public event panel.

The detailed build notes live on dedicated site pages so the methodology stays visible instead of buried in repo docs.

High-Signal Profiles

Top Recidivists

People with the most jail admissions. Click any row for the full timeline, gap structure, linked arrests, and person-level history. The person identity here is exact within DOC. Any linked arrests you see later are candidate bridge matches, not ground-truth cross-system IDs.

INMATEIDAdmissionsTierRaceSexBirth YearFirst AdmissionLast AdmissionAvg StayFirst Charge

Admissions by Year

1-Year Return Rate

Scope

What this site covers well

Exact DOC repeat-admission histories, timing between jail episodes, cohort return rates, and a narrow arrest-to-jail bridge subset where the public fields line up cleanly enough to keep only unique 1:1 matches.

Limits

What public data still cannot do

This explorer does not claim court outcomes, prison histories, parole, or true multi-arrest person resolution across the full NYC criminal-justice pipeline. Those require restricted identifiers that are not exposed in public bulk releases.