SCB Import

Bulk ingestion of SCB (Statistiska centralbyrån) foundation/organisation registry.

File: src/import/scb-import.ts.

Source: docs/SYSTEM_OVERVIEW.md § Data imports → SCB.

Run

bun run src/import/scb-import.ts data/scb_bulkfil_JE_20260302T060013_63.txt

Format

  • TSV (tab-separated)
  • Streaming parser

Volume

  • 646,127 rows in scb_foundations (current DB)

Key columns

  • Reklam (column index 20) → advertising_block boolean
    • Value "2" means the company has opted out of marketing → see Reklamspärr
  • Ng1Ng5sni_codes (SNI = Swedish industry classification)
  • Ng1sni_primary

Target schema

CREATE TABLE scb_foundations (
  org_nr            TEXT UNIQUE NOT NULL,
  name              TEXT,
  sni_codes         TEXT[],      -- Ng1–Ng5
  sni_primary       TEXT,        -- Ng1
  advertising_block BOOLEAN,     -- true = SCB Reklam=2
  entity_status     TEXT
);

See Database Schema.

Compliance impact

advertising_block = true → must skip enrichment. Enforced by isScbAdvertisingBlocked() in EnrichV7 step 1, but NOT by the queue worker (P0 in Known Issues).

See also

Bolagsverket Import, Reklamspärr, Database Schema, Known Issues.