Dream Commons Aggregates: Monthly, Open Data for Dream Research

Blog Image

Dream Commons Aggregates

Today we’re launching *Dream Commons Aggregates, a monthly, de-identified dataset derived from The Dream Drop. Each release is citable, versioned, and designed for reproducible research, teaching, and exploratory analysis.

Concept DOI (always latest)
September 2025 release (version DOI)


What’s in each monthly release

  • High-level counts (sample size, engagement).
  • Per-feature distributions across domains (e.g., Type, Mood, Theme, Characters, Perspective, Time, Impact, Lucidity, Recurring).
  • Month-over-month deltas in percentage points.
  • Stopword-filtered term frequencies (CJK-aware).
  • Documentation: SCHEMA.md, QUALITY_REPORT.csv, MANIFEST.json, CHANGES.md, CITATION.txt, LICENSE.txt, DATA_USE.txt.
  • Convenience: JSON/Parquet mirrors, a replication notebook, and a plain-text pulse summary of the largest positive/negative movements.

Everything is UTF-8, with checksums for core artifacts to support integrity verification and reproducible workflows.


Why we’re doing this

This series puts our Collective Dream Model (CDM) into practice: individual logs in The Dream Drop contribute to a growing, open knowledge base. By releasing standardized monthly aggregates, we:

  • lower the barrier to independent validation and secondary analysis,
  • support teaching and method development with clean, versioned data,
  • preserve a public record of change over time.

How to use & cite

  • License: CC BY-NC 4.0 (non-commercial).
  • Attribution: “Root Code Collective — Dream Commons (monthly aggregate release)”.
  • Citation: Use the Version DOI for the month you analyze. For general pointers, see our Citation Guidelines.

Quick links:


Privacy & ethics

Releases contain only de-identified aggregates, no personal, raw, or row-level data. Please:

  • avoid granular breakdowns that could expose small groups,
  • refrain from any re-identification attempts,
  • report results with appropriate caveats.

See DATA_USE.txt for the full policy.


The Monthly Pulse

Each release includes a short pulse summary, the largest positive and negative movements by domain (vs. the previous month). Here’s the format readers will see:

Type +4.9% standard -3.9% positive

Mood +3.1% defeated -3.2% surprise

… (full summary in pulse_movements_YYYY-MM.txt)


For researchers & educators

  • Reproducibility: A notebook (notebooks/replicate_pulse_YYYY-MM.ipynb) demonstrates quick checks, visuals, and pulse extraction.
  • Interoperability: CSV + JSON + Parquet, with a clear schema and codebook.
  • Provenance: Build hashes and checksums are included for auditability.

If you need a different format or a small convenience export for teaching, email us: contact@rootcodecollective.org.


Roadmap

  • Light, human-readable methods preprint describing the pipeline end-to-end.
  • Additional derived tables (e.g., longer-horizon deltas) as the series matures.
  • Community examples and teaching notebooks.

Get the data

Thank you for supporting ethical, open-access dream research. If you build something with the data, paper, visualization, teaching module, tell us! We’d love to feature it.