Skip to main content

Ten Crucibles · classification to graph · 184 countries · 2,687 topics

Human knowledge, walked from classification system to knowledge graph.

Ten hand-authored sections cover the epistemology / how-knowledge-is-organised layer that research and citation discipline depend on: classification systems · encyclopedias & reference · open knowledge · academic publishing · scholarly databases · standards bodies · patents · archives & records · digital preservation · knowledge graphs. Dewey-LCC-UDC-MeSH-ICD-HS taxonomies, Britannica-Larousse-Brockhaus and SEP signed-authority pattern, Wikipedia 6.8M plus Wikidata 110M plus OSM plus IA, Plan S and Nature-Cell-Lancet peer review, Scopus-WoS-PubMed-Scholar, ISO-IEEE-IETF-W3C, WIPO-PCT and FRAND-SEPs, NARA-TNA-NAI plus FamilySearch, OAIS-LOCKSS-Portico, Wikidata-DBpedia-Schema.org-LOD. No filler — every Crucible cites real institutions, real frameworks, real numbers.

Classification systems

Dewey, LCC, UDC, BISAC, MeSH, ICD, NAICS, HS — the taxonomies that organise human knowledge.

Classification systems are the underlying ontologies of every library, every database, every regulatory regime, every trade flow. The platform documents the major systems. Library and bibliographic: Dewey Decimal Classification (DDC) is used by the majority of public libraries globally with 10 main classes, 100 divisions, 1000 sections; 23rd edition active. Library of Congress Classification (LCC) dominates US academic libraries with 21 main classes A-Z; expandable to many tens of thousands of sub-classes. Universal Decimal Classification (UDC) — multilingual, used widely in European and Russian libraries. Colon Classification — S.R. Ranganathan's faceted-classification system, influential particularly in Indian library science. Subject heading systems: LCSH (Library of Congress Subject Headings), MeSH (Medical Subject Headings, the NLM controlled vocabulary used in PubMed indexing), MeSH-EU and other linguistic adaptations. Publishing-trade: BISAC subject codes (US trade), Thema (international trade adopted post-2014). Medical and clinical: ICD-10 / ICD-11 (WHO disease classification), SNOMED CT (clinical terminology), LOINC (lab tests), CPT (procedures, US AMA).

Economic and trade taxonomies: HS (Harmonised System) is the World Customs Organisation 6-digit international product nomenclature with national 8-10 digit extensions — covers virtually all merchandise trade globally; the WCO HS 2022 edition is the current revision with HS 2027 in development. NAICS (North American Industry Classification System) replaced the old SIC codes for US/Canada/Mexico statistics. NACE (EU equivalent), ISIC (UN Industrial Classification), ANZSIC (Australia/NZ), NIC (Indian National Industrial Classification). The platform documents cross-walks between HS, NAICS, ISIC, NACE for trade-and-industry analysis.

Geographic and population taxonomies: ISO 3166 (country codes alpha-2, alpha-3, numeric — the foundation of nearly all cross-border data systems), ISO 639 (language codes), ISO 4217 (currency codes), UN M.49 (region codes), NUTS (EU regional classification), FIPS (US federal), UN/LOCODE (location codes for ports / inland points), GeoNames (the open-data populated-place database with around 12M place names), OSM features (the OpenStreetMap typology). Demographic taxonomies: ICD-related cause-of-death classification for vital registration; ISCO (International Standard Classification of Occupations) for labour-market data; ISCED (International Standard Classification of Education) for cross-border education comparison.

Encyclopedias & reference

Britannica, Larousse, Brockhaus, SEP, Stanford Encyclopedia of Philosophy — curated authority references.

Encyclopedic reference works represent the most-curated authority layer in any knowledge ecosystem, with editorial governance, peer review, and explicit citation discipline distinguishing them from open-collaborative alternatives. The platform documents the major traditions. Encyclopædia Britannica — founded Edinburgh 1768; the print 32-volume 15th edition was the last published in 2010; now digital-only with rolling editorial updates. Larousse (French; published from 1856; the Petit Larousse Illustré remains a flagship single-volume annual). Brockhaus (German; founded 1796; print discontinued 2013, digital licensing continues). Encyclopaedia Iranica (Persian-civilisation reference; Columbia University-anchored). Banglapedia, Encyclopaedia of Islam (Brill, Leiden), New Catholic Encyclopedia, Jewish Encyclopedia / Encyclopaedia Judaica, Encyclopedia of Indian Philosophies (Karl Potter), Bharatkosh (Hindi general reference). Subject-specific scholarly encyclopedias: SEP (Stanford Encyclopedia of Philosophy — the gold-standard scholarly reference for academic philosophy), MIT Encyclopedia of the Cognitive Sciences, Encyclopedia of Mathematics (Springer), Grove Music Online (Oxford), Oxford Dictionary of National Biography.

The encyclopedic discipline distinguishes itself from open-content via signed expert authorship (the SEP entry on a topic is written by an acknowledged specialist), explicit editorial review, citation rigor, and version stability with archived prior versions. SEP's technology is particularly notable: every entry is dated, has an explicit table of contents, footnotes are full bibliographic, and the entry is archived on every substantive update. Comparison with Wikipedia: SEP has approximately 1,800 entries (curated philosophy); Britannica has approximately 120,000 entries; Wikipedia has around 6.8M English articles — the trade-off is breadth-versus-curation, with the platform's recommendation that policy-grade or research-grade citation should generally use signed-and-dated encyclopedic sources rather than collaborative-editing platforms.

Specialist reference works: medical reference (Harrison's Principles of Internal Medicine, Goodman & Gilman's Pharmacological Basis, Williams Hematology, Robbins Pathology — the serial-edition standards), legal (Halsbury's Laws of England, Restatement of the Law US, Cornell Legal Information Institute, India Code, Singapore AGC database), economic / financial (Palgrave Macmillan New Palgrave Dictionary of Economics, IMF Glossary of Selected Financial Terms, BIS / FSB / IOSCO documents), statistical (UN Statistics Division glossaries, OECD Statistical Methods Compendium, ESS standards). Translation reference: bilingual / multilingual specialist dictionaries (the EU IATE, the FAO terminology portal, ISO terminology standards). The platform's reference-discipline emphasises locating an authority source rather than relying on aggregated content.

Open knowledge

Wikipedia, Wikidata, OpenStreetMap, Project Gutenberg, Internet Archive — the global commons.

Open knowledge projects have collectively built the largest accessible reference layer in human history, with permissive licensing (CC BY-SA, CC0, public domain) enabling reuse and machine-readable formats enabling integration. The platform documents the major projects. Wikipedia (Wikimedia Foundation; founded 2001): 320+ language editions, around 6.8M articles in English alone, around 60M articles total across all languages; Vital Articles classification, GA / FA quality grading, citation policy. Wikidata (founded 2012): over 110M items with structured statements; the canonical machine-readable backbone now feeds Wikipedia infoboxes and major external knowledge-graphs. Wikimedia Commons: over 110M media files (images, audio, video) under free licences. Wiktionary: dictionary in 100+ languages. Wikisource: source-document library. Wikispecies, Wikiquote, Wikibooks, Wikinews, Wikimedia Incubator for new languages.

OpenStreetMap (OSM) is the open-content global geographic database, founded 2004; over 10M registered contributors; daily diff-and-history publication; the foundation underneath Apple Maps (extensive OSM use post-2018), Facebook Maps, Snap Map, Foursquare, MapBox, MapTiler, and many open-source geocoders. The OSM data model uses nodes / ways / relations with arbitrary tagging governed by community consensus rather than fixed schema. Coverage quality ranges from comprehensive (Western Europe, North America, Japan, India urban) to sparse (much of rural Africa, parts of Central Asia) — with HOT (Humanitarian OpenStreetMap Team) coordinated mapping in disaster-response and underserved-area projects.

Open-content libraries: Project Gutenberg (public-domain ebooks, founded 1971, around 70K titles), Internet Archive (founded 1996; the Wayback Machine archives over 800 billion web pages, plus IA Books / Audio / TV / Software / Open Library / Scholar; the controversial Controlled Digital Lending lawsuit by publisher consortium 2023 was lost by IA on initial ruling, with appeal continuing), HathiTrust (academic-library digitisation consortium), Europeana (EU cultural-heritage aggregator), DPLA (Digital Public Library of America), Trove (National Library of Australia), National Digital Library of India (NDLI). OpenLibrary, WorldCat (the OCLC catalogue with 500M+ records), BHL (Biodiversity Heritage Library) round out the major open-knowledge layer. The platform documents licensing and attribution requirements for reusing each source.

Academic publishing

Peer review, preprints, open access, impact factors — the scholarly-output ecosystem.

Academic publishing remains the institution that certifies scholarly contribution, with peer review as the central quality-assurance mechanism and citation as the recognition currency. The platform documents the major dimensions. The major commercial publishers: Elsevier (RELX) operates around 2,800 journals including The Lancet and Cell; Springer Nature operates the Nature family plus Springer + Palgrave + BMC; Wiley operates around 1,600 journals; Taylor & Francis around 2,700; SAGE around 1,000. The leading individual journals by reputation: Nature, Science, Cell, Lancet, NEJM, JAMA, BMJ, PNAS, Physical Review Letters; in field-specific terms there are several thousand journals carrying real influence. Society publishers: ACS (American Chemical Society), AIP (American Institute of Physics), IOP (UK Institute of Physics), Royal Society, AMS (Mathematics), ASCE / ASME / AIChE engineering, MLA (Modern Language Association). University presses: Oxford University Press, Cambridge University Press, Princeton, Harvard, Chicago, MIT Press, Yale, Stanford, Indian university presses (OUP-India, CUP-India).

Preprint servers have transformed dissemination speed in many fields. arXiv (founded 1991 at LANL, now Cornell-hosted): the foundational preprint server for physics / mathematics / computer science / quantitative biology / quantitative finance / statistics; over 2M papers; daily new postings drive the conversation in many sub-fields. bioRxiv (Cold Spring Harbor; biology), medRxiv (medicine; the COVID-era surge was disciplinary turning-point), SSRN (Elsevier-owned; social sciences and humanities), Research Square, ChemRxiv, EarthArXiv, PsyArXiv. The platform documents preprint-citation conventions: emerging consensus toward citing preprint plus DOI plus version, and toward formal preprint-to-publication chain-of-record disclosure.

Open Access (OA) and Plan S: the mandate originating with cOAlition S funders (initially Wellcome, ERC, FNR, etc.) requires Plan S compliance for funded research output; transformative agreements (read-and-publish deals) between universities and commercial publishers have substantially shifted Europe's journal-economics. OA models: gold (publisher-side OA with article-processing-charges), green (author-self-archived in repository), diamond (no-fee OA, often via society or scholar-led journals like the SciELO network in Latin America). Predatory journals: Beall's List historical legacy, Cabells Predatory Reports current commercial offering, DOAJ inclusion-criteria tightening, plus retraction-tracking via Retraction Watch; the platform's academic-source filtering criteria are explicit in /methodology/.

Scholarly databases

Scopus, Web of Science, PubMed, Scholar, JSTOR — the search layer over scholarship.

Scholarly databases are the indexing-and-discovery layer over the academic-publishing output. The platform documents the major systems. Multidisciplinary citation databases: Scopus (Elsevier; covers around 27,000 journals; provides citation-counts, h-index, SJR-Scimago Journal Rank), Web of Science (Clarivate; selective coverage of around 21,000 journals; provides JIF Journal Impact Factor in JCR Journal Citation Reports), Google Scholar (free; broadest coverage including grey-literature, theses, working-papers; less-curated quality assurance; Publish-or-Perish desktop app extracts citation metrics), Dimensions (Digital Science; alternative comprehensive index), OpenAlex (open-data successor to Microsoft Academic Graph; 240M+ works). Subject-specific: PubMed (NLM; biomedical — the dominant medical-literature search), Embase (Elsevier; biomedical with stronger pharmacology coverage), CINAHL (nursing / allied health), PsycINFO (APA; psychology), ERIC (education).

Discipline-specific databases: IEEE Xplore (electrical / electronics / computing), ACM Digital Library (computing), SciFinder / CAS (chemistry / pharmaceutical patents), MathSciNet (American Mathematical Society), zbMATH (European Mathematical Society), INSPIRE-HEP (high-energy physics), NASA ADS (astrophysics), SPIE Digital Library (optics / photonics), EconLit (American Economic Association; economics indexing), RePEc (open-economics; the Federal Reserve Bank of Saint Louis FRED-related repository), SSRN (overlapping preprint-and-database role for social science), JSTOR (Andrew W. Mellon Foundation; arts / humanities backfile).

Citation metrics and bibliometrics: JIF (Journal Impact Factor) — the most-cited but most-criticised metric; 2-year window of citations / publications; field-comparison-distorted; field-normalised alternatives (FWCI, SNIP) emerged in response. h-index (Hirsch 2005; balances productivity and impact) is the dominant per-author metric. SCImago SJR, SNIP (CWTS), Eigenfactor (PageRank-style for journals), Altmetric (social-media + policy + news mentions). DORA (Declaration on Research Assessment): signed by major funders / institutions / publishers; pushes back against single-metric (JIF) evaluation; pushes for content-evaluation and broader contribution-recognition. The platform's scholarship-citation discipline aligns to DORA principles.

Standards bodies

ISO, IEEE, IETF, W3C, Codex Alimentarius, ITU — the technical-standard infrastructure.

Standards bodies define the technical interfaces that allow global commerce, communication, and safety to function. The platform documents the major organisations. ISO (International Organisation for Standardisation): founded 1947, Geneva-headquartered; over 24,000 published international standards; member-bodies are national-standards bodies (BIS-India, ANSI-US, BSI-UK, DIN-Germany, AFNOR-France, JISC-Japan, SAC-China). IEC (International Electrotechnical Commission): electrical and electronic standards. ITU (International Telecommunication Union): UN specialised agency; ITU-T (telecom standardisation), ITU-R (radio-communication), ITU-D (development); the WTSA / WRC / WTDC quadrennial conferences. IEEE (Institute of Electrical and Electronics Electronics): over 1,100 published standards; IEEE 802 family (Ethernet, Wi-Fi, Bluetooth, Zigbee), IEEE 754 floating-point, IEEE 1800 SystemVerilog, etc. IETF (Internet Engineering Task Force): maintains internet-protocol standards via RFC documents; over 9,000 RFCs published.

Web and software standards: W3C (World Wide Web Consortium) — HTML, CSS, SVG, XML, RDF, OWL, accessibility (WCAG), the entire web-platform. WHATWG (parallel HTML/DOM Living Standards process). Ecma International (ECMAScript / JavaScript via ECMA-262, JSON via ECMA-404). Unicode Consortium (Unicode Standard, Unicode 16 current; CLDR locale data). Khronos Group (OpenGL, Vulkan, OpenCL, glTF). Bluetooth SIG, Wi-Fi Alliance, USB-IF, Trusted Computing Group, 3GPP (cellular generations 4G LTE, 5G, 6G research) are the major industry-consortium standards bodies. The platform indexes standard-track and spec-stage by major technology area.

Sector-specific standards: Codex Alimentarius (FAO/WHO joint — food safety standards globally), OIE / WOAH (animal health), IPPC (plant health), ICAO Annexes (aviation), IMO conventions (maritime — SOLAS, MARPOL, STCW), OIML (legal metrology), BIPM / CGPM / SI (units of measurement), ICH (International Council for Harmonisation — pharmaceutical regulation). Industry-association standards: API (American Petroleum Institute), ASME (mechanical engineering), ASTM International (materials testing), AISC (steel construction), ACI (concrete). The platform documents the WTO TBT Agreement reference: international standards take precedence as legitimate-basis for regulatory measures, making standards-body output economically consequential.

Patents & technical disclosure

WIPO PCT, EPO, USPTO, JPO, CNIPA — the patent system as global knowledge layer.

The patent system is, beyond its commercial-IP function, a vast technical-disclosure layer with mandatory-disclosure-in-exchange-for-monopoly trade-off. The platform documents the major systems. WIPO PCT (Patent Cooperation Treaty): enables international patent application via single procedure with national-phase entry across PCT member states (over 150 contracting states); around 280,000 PCT international applications filed annually. Major national / regional patent offices: EPO (European Patent Office): grants European Patents valid in EPC contracting states (around 40 states; UPC / Unitary Patent regime since 2023 covers around 17 EU states with single-effect protection), USPTO (US Patent and Trademark Office): largest individual office by total grants, with utility / design / plant patent classes, JPO (Japan Patent Office), KIPO (Korean Intellectual Property Office), CNIPA (China National Intellectual Property Administration) — now the largest by application volume, Indian Patent Office (Section 3(d) generics-friendly carve-out post-2005 product-patent regime).

Prior-art search and examination: USPTO PAIR, EPO Espacenet (free; over 140M global patent documents), WIPO PATENTSCOPE, Google Patents (full-text search across major offices), Lens.org (open-data patent intelligence), Derwent World Patents Index (Clarivate commercial), Orbit Intelligence (Questel commercial). Patent classification: IPC (International Patent Classification, WIPO), CPC (Cooperative Patent Classification, USPTO+EPO joint), Japanese FI / F-term, Locarno (designs). Patent families: INPADOC, DOCDB, simple family vs extended family methodology — relevant for understanding global filing strategies.

Specific regimes and practical issues: SEPs (Standard-Essential Patents): FRAND (fair, reasonable, non-discriminatory) licensing obligations for SEPs declared to standards bodies (3GPP, ETSI, IEEE) — the central commercial-IP friction in mobile, video-codec (H.264, HEVC, VVC, AV1 royalty-free counter-stack), Wi-Fi, Bluetooth standards. Patent thickets and pools: MPEG LA, Sisvel, Avanci, HEVC Advance the major commercial pools. Compulsory licensing under TRIPS Article 31 (active in HIV/AIDS antiretrovirals historically; pandemic-period proposals; Indian and Brazilian compulsory-licensing precedents). Defensive publication: Research Disclosure, IP.com, Google Cloud open-source patent commitments — alternatives to patenting that establish prior-art without seeking exclusive rights.

Archives & records

NARA, TNA, NAI, vital records, genealogy — the historical-record infrastructure.

Archives and records-management infrastructure preserves the documentary basis of historical, legal, regulatory, and family memory. The platform documents the major institutions. National archives: US NARA (National Archives and Records Administration) — over 13B pages plus electronic records; College Park MD plus regional sites; Presidential libraries integrated. UK TNA (The National Archives) — Kew; Domesday Book through current government records. Indian NAI (National Archives of India) — Delhi plus regional records centres; ASI archaeological records separately maintained. French Archives nationales, German Bundesarchiv, Russian GARF / RGADA / RGIA federal archives, Chinese Number One and Number Two Historical Archives, Australian NAA, Canadian LAC (Library and Archives Canada), Japanese NDL (National Diet Library) plus JACAR online, Brazilian Arquivo Nacional, Mexican AGN, Argentinian AGN. UN Archives and Records Management Section, League of Nations Archives (Geneva). ICRC archives (Geneva).

Vital records and civil registration: birth / marriage / death registration regimes vary widely. Strong-coverage economies (most of Europe, Anglo-settler economies, Japan, Korea, much of Latin America) have effectively-universal vital registration with multi-decade backfile. Coverage gaps: many lower-income economies still have substantial under-registration; CRVS (Civil Registration and Vital Statistics) programmes by WHO / UNICEF / UNFPA / World Bank coordinate strengthening. Genealogical research: FamilySearch (LDS Church-operated; the world's largest free genealogy resource with 14B+ records and growing microfilm-digitisation), Ancestry.com (commercial; AncestryDNA leading consumer-genetic genealogy), FindMyPast (UK / Irish / Australian focus), MyHeritage, 23andMe, Geni. Specialist archives: shoah / holocaust archives (USHMM, Yad Vashem, Wiener Holocaust Library), South African TRC archive, Khmer Rouge documents (DC-Cam, Tuol Sleng), Stasi files (BStU then federal-archive).

Records-retention regimes for businesses and individuals: tax records: typically 7+ years US (with up to 10 for substantial-omission cases), 6 years UK (with longer for VAT and corporation tax under specific circumstances), 8 years India (Section 143(2) and reassessment provisions), 7 years EU GDPR generally permits but specific sectoral rules vary. HR records: 7 years US (federal minimum; state may extend), 6 years UK. Medical records: 25-30 years typical for adult care; longer for pediatrics. Building records: lifetime-of-building plus 12 years typical UK. Corporate records: indefinite for incorporation / share-issue documents, 6+ years for board minutes most jurisdictions. The platform's records-retention table indexes by record-type, jurisdiction, and statutory basis.

Digital preservation

Internet Archive, LOCKSS, Portico, format migration — fighting bit-rot at scale.

Digital preservation is the long-term insurance layer for digital knowledge, with format-obsolescence, hardware-decay, and institutional-collapse all posing structural risks. The platform documents the major frameworks. OAIS (Open Archival Information System): ISO 14721 reference model defining functional entities (Ingest, Archival Storage, Data Management, Preservation Planning, Access, Administration); the conceptual foundation for nearly all serious digital-preservation programmes. Major preservation services: LOCKSS (Lots of Copies Keep Stuff Safe), Stanford-led; the original distributed-preservation network for journal content. CLOCKSS (Controlled LOCKSS): dark archive with trigger-event publication. Portico: Ithaka-operated; subscription-supported journal preservation. National-library preservation: BL (UK), BNF (France), DNB (Germany), Library of Congress (US), NDL (Japan), NLI (India), NLA (Australia), KB (Netherlands).

Web archiving: Internet Archive Wayback Machine archives over 800B web captures; Common Crawl open-data web crawl (around 250B pages); Archive-It (IA service for institutional collections). National web archives: UK Web Archive (BL), French BNF web-legal-deposit, Library of Congress web archive, Australian PANDORA / AWA, Indian NDLI components. Digital legal deposit: many jurisdictions have extended legal-deposit obligations from print to digital materials post-2010. Personal-archiving tools: ArchiveBox (open-source), SingleFile, Pinboard, Memex, Web Archive, archive.today (commercial alternative to IA). The platform's recommendation: archive every external citation in serious knowledge work to prevent link-rot — substantial empirical-research evidence shows 30-50% link-rot within 5 years for general web content.

Format and metadata standards: METS (Metadata Encoding and Transmission Standard), PREMIS (Preservation Metadata Implementation Strategies) — the core preservation-metadata frameworks. File-format obsolescence: open and well-documented formats (PDF/A, plain text, TIFF baseline, FLAC, WARC) preferred for preservation; proprietary single-vendor formats (older AutoCAD, older Pro Tools, older Microsoft Office binary) are migration risks. Format-migration tools (UDFR, PRONOM signature library, JHOVE / FITS validators); emulation as alternative-or-complement to migration (the Internet Archive emulator collections demonstrate the principle for software / games preservation). Bit-rot: hardware media half-life concerns drive periodic refresh-and-checksumming (BagIt manifest, fixity-checks); the platform's digital-preservation guides cover practical workflows for individual / small-organisation / large-institution scales.

Knowledge graphs & semantic web

Wikidata, DBpedia, Schema.org, Linked Open Data — machine-readable knowledge.

Knowledge graphs make machine-readable, queryable knowledge possible at scale, with applications across search, AI training, scholarly discovery, business intelligence, and ontology engineering. The platform documents the major resources. Wikidata: the canonical open-knowledge graph; over 110M items with structured statements, multilingual labels, references; SPARQL endpoint at query.wikidata.org. DBpedia: Wikipedia-extracted RDF graph; long-running parallel to Wikidata; multilingual chapters. YAGO: combines Wikipedia + WordNet + GeoNames; Max Planck Institute-hosted. ConceptNet: commonsense knowledge graph (MIT-led origin). Google Knowledge Graph: proprietary; powers Google Search infoboxes; estimated 5B+ entities. Bing Satori / Microsoft Knowledge Graph, Apple Spotlight knowledge, Amazon Product Graph, Meta entity graph, X / Twitter knowledge, LinkedIn Economic Graph are major-platform proprietary graphs.

Semantic Web stack and standards: RDF (Resource Description Framework) — W3C standard; the data model. RDF Schema, OWL (Web Ontology Language) — ontology layers. SPARQL — query language. SHACL — shape constraints. JSON-LD — the JSON-encoded RDF flavour increasingly used for embedded structured-data on the web (Google adopting JSON-LD as its preferred Schema.org delivery mechanism; Schema.org markup is now near-universal in major-publisher SEO). SKOS — Simple Knowledge Organisation System for thesauri / taxonomies. Linked Open Data (LOD) cloud: visualisation of cross-linked datasets — from a few dozen datasets at LOD inception (2007) to over 1,300 datasets currently catalogued.

Major ontologies and vocabularies: Schema.org (Google + Microsoft + Yahoo + Yandex; over 800 types and 1,400 properties; the de-facto SEO-relevant entity vocabulary). FOAF (Friend-of-a-Friend): people / social. SIOC: online communities. Dublin Core: metadata baseline. BIBFRAME: bibliographic (Library of Congress; eventual MARC successor). FIBO (Financial Industry Business Ontology). Gene Ontology (GO): biology. SNOMED CT, LOINC, RxNorm: clinical. ICD-O: oncology. NASA SWEET: earth science. QUDT: units / measurement. PROV: provenance. schema.org / DataCommons / OpenAlex / OpenStreetMap are the open-data backbone now feeding LLM training corpora — making knowledge-graph hygiene a frontier-AI input variable.

Knowledge domains matrix — 20 fields

Discipline, depth score, recommended starting resource — the cross-discipline knowledge-mapping dataset.

DomainCategoryDepthStarting resourceNote
World History Humanities 96 Hartog Cassell · Stearns world history Vast field. Western/Eastern/Global frame matters.
Philosophy Humanities 92 Bertrand Russell · Stanford SEP Western: Plato → Kant → analytic. Eastern: Confucian, Buddhist, Vedic.
Economics Social-science 92 Mankiw Macroeconomics · Krugman Micro/macro distinction. Behavioural challenges classical models.
Physics STEM 96 Halliday-Resnick · Feynman lectures Classical → quantum → relativity arc. Math-prerequisite intensive.
Biology STEM 88 Campbell Biology Cell + genetics + evolution + ecology pillars.
Chemistry STEM 85 Atkins Chemistry · OpenStax Inorganic + organic + physical + biochemistry branches.
Mathematics STEM 92 Stewart Calculus · Strang Linear Algebra Calculus → linear algebra → statistics → discrete math arc.
Literature Humanities 88 Norton Anthology · Cambridge Companions World-literature canon expanding beyond Western. Genre vs period frames.
Art History Humanities 78 Janson · Gombrich · Khan Academy Western canon dominant; non-Western fields growing rapidly.
Music Theory Arts 72 Tonal Harmony · Open Music Theory Western tonal foundation; modal + non-Western increasing emphasis.
Psychology Social-science 82 Myers Psychology · Coursera Cognitive + clinical + developmental + social branches.
Sociology Social-science 72 Giddens Sociology Theory → methods → contemporary issues. Marx/Weber/Durkheim foundation.
Political Science Social-science 78 Heywood Politics Comparative + international + theory + methods branches.
Comparative Religion Humanities 78 Smith World's Religions Abrahamic + Dharmic + indigenous + secular framings.
Geography Social-science 72 Atlas + GIS · National Geographic Physical + human + GIS branches. Climate-change increasing relevance.
Anthropology Social-science 72 Kottak Anthropology Cultural + biological + archaeological + linguistic branches.
Linguistics Humanities 72 Yule Study of Language Phonology · syntax · semantics · pragmatics · historical · sociolinguistics.
Engineering STEM 92 Various: Felder · Hibbeler · Ogata Civil · mechanical · electrical · chemical · computer-engineering specialties.
Medicine STEM 96 Robbins Pathology · Harrison's Pre-med → MD/medical school → residency → fellowship pyramid.
Law Social-science 85 Country-specific texts Common-law (US/UK) vs civil-law systems. Specialisation branches.

Source: Stanford Encyclopedia of Philosophy · Britannica · Khan Academy · academic textbook canon. 20 domains.

Knowledge-ranked listicle index — 5 themes

Hardest fields to master, fastest-to-revenue, best self-study, multidisciplinary combos, oldest disciplines still relevant.

Curated cross-cuts.

Hardest fields to master

Time-to-competence + cognitive load

Top: Medicine · Theoretical physics · Pure math

Fastest fields to revenue

Time from learning to first paid work

Top: Web dev · Digital marketing · Sales

Best fields for self-study (no degree)

Strong self-taught success path

Top: Programming · Writing · Sales

Best multidisciplinary combinations

Cross-domain career multiplier

Top: CS+economics · Bio+CS (bioinfo) · Law+tech

Oldest disciplines still highly relevant

Ancient → modern continuity

Top: Philosophy · Mathematics · Astronomy

5 listicles in v206.4 ship.

PDF reference shelf — knowledge

Encyclopedic + reference authoritative sources.

Knowledge corpus catalogued. · 4 top sources surfaced.

Stanford Encyclopedia of Philosophy
Stanford University · 2026 · knowledge
Encyclopedia Britannica Online 2026
Britannica · 2026 · knowledge
Khan Academy Curriculum Database 2025
Khan Academy · 2025 · knowledge
Wikipedia Citation Quality Report 2025
Wikimedia · 2025 · knowledge