About Battery Data Commons
A curated, continuously updated metadata commons for battery datasets, designed as long-term research infrastructure.
Purpose & Scope
The Battery Data Commons serves as the public-facing interface of a curated battery dataset catalogue developed to support transparency, reuse, and comparability in battery research.
The website is not a data repository. It does not host raw datasets, perform data processing, or enforce experimental standardisation. Instead, it provides a metadata commons that makes existing public battery datasets discoverable, comparable, and citable.
The website is designed as long-term research infrastructure for the battery community, rather than as a static project webpage.
Curation Methodology
Curation Over Aggregation
All publicly visible records have undergone human curation. No automatically harvested or user-submitted record appears on the website without manual review.
Metadata, Not Measurements
The Commons indexes structured metadata describing datasets, not the datasets themselves. All data access occurs via persistent external links (e.g., DOIs, repository landing pages).
Versioned Public State
The public website always reflects a released, versioned snapshot of the catalogue. Intermediate states (harvesting, staging, review) are never exposed publicly.
Provenance & Auditability
For every dataset record, the origin of the information (source repository, review process, update history) is explicitly traceable.
Governance
Dual Intake Model
The Commons supports two complementary intake channels:
- Automated discovery: Periodic harvesting of candidate records from public APIs (Zenodo, DataCite, Crossref), used strictly for candidate identification
- Community contribution: Users may propose new datasets, corrections, or updates via an issue-based submission mechanism
Both channels feed into the same curation pipeline, and all records undergo human review before publication.
Canonicalisation & Release
Approved records are mapped to a unified metadata schema and added to the canonical catalogue. Public updates occur only through versioned releases, each representing an immutable snapshot.
How to Cite
When citing the Battery Data Commons, please use:
Battery Data Commons. (2026). Battery Data Commons: A curated metadata catalogue for battery datasets (Version 1.0). https://battery-data-commons.org
When citing a specific snapshot, include the version number. The initial release is v1.0.
Dataset Categories
Datasets are organised into the following categories:
- Performance: Capacity, rate capability, efficiency characterisation
- Ageing: Cycle and calendar degradation studies
- Field: Real-world operational data from deployed systems
- Modelling: Validation data for simulation frameworks
- Safety: Abuse testing, thermal runaway, failure modes
- Diagnostics: EIS, pulse tests, state estimation data
Contact
For questions, suggestions, or collaboration inquiries, please open a discussion on GitHub Discussions.
To submit a new dataset or correction, please visit the Submit page.