Dream Commons Aggregates
Today we’re launching *Dream Commons Aggregates, a monthly, de-identified dataset derived from The Dream Drop. Each release is citable, versioned, and designed for reproducible research, teaching, and exploratory analysis.
Concept DOI (always latest)
September 2025 release (version DOI)
What’s in each monthly release
- High-level counts (sample size, engagement).
- Per-feature distributions across domains (e.g., Type, Mood, Theme, Characters, Perspective, Time, Impact, Lucidity, Recurring).
- Month-over-month deltas in percentage points.
- Stopword-filtered term frequencies (CJK-aware).
- Documentation: SCHEMA.md, QUALITY_REPORT.csv, MANIFEST.json, CHANGES.md, CITATION.txt, LICENSE.txt, DATA_USE.txt.
- Convenience: JSON/Parquet mirrors, a replication notebook, and a plain-text pulse summary of the largest positive/negative movements.
Everything is UTF-8, with checksums for core artifacts to support integrity verification and reproducible workflows.
Why we’re doing this
This series puts our Collective Dream Model (CDM) into practice: individual logs in The Dream Drop contribute to a growing, open knowledge base. By releasing standardized monthly aggregates, we:
- lower the barrier to independent validation and secondary analysis,
- support teaching and method development with clean, versioned data,
- preserve a public record of change over time.
How to use & cite
- License: CC BY-NC 4.0 (non-commercial).
- Attribution: “Root Code Collective — Dream Commons (monthly aggregate release)”.
- Citation: Use the Version DOI for the month you analyze. For general pointers, see our Citation Guidelines.
Quick links:
- Concept DOI (always latest): https://doi.org/10.5281/zenodo.17297159
- September 2025 dataset: https://doi.org/10.5281/zenodo.17297160
Privacy & ethics
Releases contain only de-identified aggregates, no personal, raw, or row-level data. Please:
- avoid granular breakdowns that could expose small groups,
- refrain from any re-identification attempts,
- report results with appropriate caveats.
See DATA_USE.txt for the full policy.
The Monthly Pulse
Each release includes a short pulse summary, the largest positive and negative movements by domain (vs. the previous month). Here’s the format readers will see:
Type +4.9% standard -3.9% positive
Mood +3.1% defeated -3.2% surprise
… (full summary in pulse_movements_YYYY-MM.txt)
For researchers & educators
- Reproducibility: A notebook (
notebooks/replicate_pulse_YYYY-MM.ipynb) demonstrates quick checks, visuals, and pulse extraction. - Interoperability: CSV + JSON + Parquet, with a clear schema and codebook.
- Provenance: Build hashes and checksums are included for auditability.
If you need a different format or a small convenience export for teaching, email us: contact@rootcodecollective.org.
Roadmap
- Light, human-readable methods preprint describing the pipeline end-to-end.
- Additional derived tables (e.g., longer-horizon deltas) as the series matures.
- Community examples and teaching notebooks.
Get the data
- Latest dataset (concept DOI): https://doi.org/10.5281/zenodo.17297159
- September 2025 (version DOI): https://doi.org/10.5281/zenodo.17297160
Thank you for supporting ethical, open-access dream research. If you build something with the data, paper, visualization, teaching module, tell us! We’d love to feature it.