EPA ECHO
Facilities, violations, cases, emissions
ECHO → RBLC → LLM
EPA ECHO 벌크 데이터, RBLC control evidence, state event records, and planned small-domain LLMs turn air-permit assumptions into an auditable pre-permit risk memo.
Facilities, violations, cases, emissions
Controls, limits, applicability
Explain, cite, summarize, checklist
ECHO petrochemical/EG candidates
Temporal forward validation
Grouped by REGISTRY_ID split
Petrochemical permit screens
These are validation metrics for the research pipeline, not live permit approval probabilities.
EA means Enforcement Action. ROC-AUC measures how well the model ranks future enforcement-action cases above non-action cases when validation is done forward in time.
F1 balances precision and recall for extracted text labels. The split is grouped by REGISTRY_ID so records from the same facility do not leak between train and test sets.
Critical enforcement-action risk is driven by Texas petrochemical NAICS exposure, Title V/MACT/PSD/NSR program overlap, HAP/VOC pollutants, and flare/MSS terms. Check RBLC controls and TCEQ event narratives before permit assumptions are fixed.
Consolidates federally enforceable air requirements, monitoring, reporting, and deviation certification for major stationary sources.
Screens new projects and modifications before construction. Major projects can require BACT or LAER, modeling, offsets, and public notice.
Applies source-category standards for HAP emissions, including process controls, LDAR, monitoring, recordkeeping, and work-practice limits.
Sets technology-based requirements for listed new, modified, or reconstructed source categories such as engines, heaters, tanks, and process units.
EPA clearinghouse for BACT, RACT, and LAER determinations. It helps compare controls and limits used in similar permits.
ECHO aggregates facilities, violations, inspections, emissions, and Clean Air Act enforcement actions used for analog risk screening.
Tracks CO2e reporting and measurement, reporting, and verification assumptions. It should be handled separately from CAA criteria-pollutant risk.
TCEQ, LDEQ, and other state records can reveal startup, shutdown, maintenance, flaring, and permit-revision patterns that federal data misses.
The first model is ECHO-centered. The next research layer connects RBLC controls, EIASS bilingual alignment, GHGRP/K-ETS reporting, and permit revision sequences.