About Battery Data Commons

A curated, continuously updated metadata commons for battery datasets, designed as long-term research infrastructure.

Purpose & Scope

The Battery Data Commons serves as the public-facing interface of a curated battery dataset catalogue developed to support transparency, reuse, and comparability in battery research.

The website is not a data repository. It does not host raw datasets, perform data processing, or enforce experimental standardisation. Instead, it provides a metadata commons that makes existing public battery datasets discoverable, comparable, and citable.

The website is designed as long-term research infrastructure for the battery community, rather than as a static project webpage.

Curation Methodology

Curation Over Aggregation

All publicly visible records have undergone human curation. No automatically harvested or user-submitted record appears on the website without manual review.

Metadata, Not Measurements

The Commons indexes structured metadata describing datasets, not the datasets themselves. All data access occurs via persistent external links (e.g., DOIs, repository landing pages).

Versioned Public State

The public website always reflects a released, versioned snapshot of the catalogue. Intermediate states (harvesting, staging, review) are never exposed publicly.

Provenance & Auditability

For every dataset record, the origin of the information (source repository, review process, update history) is explicitly traceable.

Governance

Dual Intake Model

The Commons supports two complementary intake channels:

  • Automated discovery: Periodic harvesting of candidate records from public APIs (Zenodo, DataCite, Crossref), used strictly for candidate identification
  • Community contribution: Users may propose new datasets, corrections, or updates via an issue-based submission mechanism

Both channels feed into the same curation pipeline, and all records undergo human review before publication.

Canonicalisation & Release

Approved records are mapped to a unified metadata schema and added to the canonical catalogue. Public updates occur only through versioned releases, each representing an immutable snapshot.

How to Cite

When citing the Battery Data Commons, please use:

Battery Data Commons. (2026). Battery Data Commons: A curated metadata catalogue for battery datasets (Version 1.0). https://battery-data-commons.org

When citing a specific snapshot, include the version number. The initial release is v1.0.

Dataset Categories

Datasets are organised into the following categories:

  • Performance: Capacity, rate capability, efficiency characterisation
  • Ageing: Cycle and calendar degradation studies
  • Field: Real-world operational data from deployed systems
  • Modelling: Validation data for simulation frameworks
  • Safety: Abuse testing, thermal runaway, failure modes
  • Diagnostics: EIS, pulse tests, state estimation data

Contact

For questions, suggestions, or collaboration inquiries, please open a discussion on GitHub Discussions.

To submit a new dataset or correction, please visit the Submit page.