How Quinn cites NI 43-101, S-K 1300, PIPEDA, and 10+ other regulatory regimes
Quintarth's RAG (Retrieval-Augmented Generation) corpus is a curated collection of ~415 chunks of public-domain regulatory text. Each chunk is a faithful summary (not a verbatim copy) of a specific provision, with a citation back to the authoritative source.
| Cluster | Chunks | Coverage |
|---|---|---|
| Canadian mineral disclosure | ~80 | NI 43-101 + CIM Definition Standards 2014 + provincial rules |
| US mineral disclosure | ~30 | S-K 1300 + SEC mining rules |
| Canadian privacy | ~18 | PIPEDA (federal) + Quebec Law 25 / Bill 64 + Alberta PIPA + BC PIPA |
| Securities (CA) | ~75 | NI 45-106, NI 51-102, NI 52-109, NI 58-101, OSC/AMF/BCSC/ASC |
| Securities (US) | ~85 | SEC Reg S-K, S-X, Form 10-K, 10-Q, 8-K, Form 4, Reg D, Reg S, Reg A+, NMS, FD, SHO, ATS |
| Cross-border / MJDS | ~12 | Multi-Jurisdictional Disclosure System, foreign issuer rules |
| Compliance / sanctions | ~25 | FINTRAC, FCPA, OFAC, Volcker, SOX, Basel III |
| Mining geology | ~30 | Deposit types, mining methods, processing, JORC, SAMREC, CRIRSCO |
| Markets / instruments | ~60 | VWAP, ATS, options, ETF mechanics, SPAC, perp DEX, RWA, MEV |
nomic-embed-text (768-dim, English-primary, runs locally on our VPS)The corpus is curated by hand. New chunks are added in corpus_v{N}.py scripts; ingest is idempotent (deterministic UUIDs prevent duplicates). The current corpus is at v7 (2026-04-25), totaling 415 chunks across 13 jurisdictions.