Linear Datamodeling Score (LDS)¶

The LinearDatamodelingMetric is a ground-truth metric (Park et al., 2023): it measures how well an attribution method predicts the actual change in model output when retraining on different subsets of the training data. The corresponding LinearDatamodeling benchmark wraps this end-to-end — including training the counterfactual subset models — and is the most expensive benchmark in quanda. This page collects the practical caveats you should know before running it.

Caveats¶

1. ``load_pretrained`` downloads M subset checkpoints. LDS retrains the model on M random subsets of the training data (default m=100 in the published configs). LinearDatamodeling.load_pretrained(...) therefore pulls down M+1 checkpoints from the Hugging Face Hub (the main model plus every subset). For the mnist_linear_datamodeling / cifar_linear_datamodeling / awa2_linear_datamodeling / qnli_linear_datamodeling benchmarks this is 100 subset checkpoints; expect a sizeable download and disk footprint on first use.

2. Counterfactual subset logits should be precomputed once and reused. During evaluation, each subset model is run over the eval dataset to produce counterfactual logits, which are then correlated with the explainer’s group attributions. These per-subset logits depend only on the subset checkpoints and the eval subsample — they do not depend on the explainer being evaluated. Recomputing them inside every evaluate(...) call is wasteful, so LinearDatamodeling exposes cache_subset_logits that runs the M forward passes once and writes them to disk; subsequent evaluate(...) calls pass that directory via subset_logits_dir=... and skip the recomputation.

To parallelize computations, you can use cache_subset_logits_per_idx for a single subset index, so the M forward passes can be split across workers.

3. Training subset models from scratch. Calling LinearDatamodeling.train(config) trains the main model and then iterates through all M subset models sequentially in the same process — fine for small benchmarks, but you will usually want to parallelize for larger runs.

The recommended pattern is to split training into two phases. train(config, skip_subsets=True) trains and persists the main model and writes the metadata (split ids, plus the subset-id file whose name is set by the subset_ids field in the config — lds_subsets.yaml in the shipped configs) but does not train any subset models. Then train_subset(config, idx=...) rebuilds the benchmark from that metadata, trains a single subset idx, and persists its checkpoint. train_subset is designed to be the unit of work for an array job — call it once per worker / GPU / SLURM task and the M subsets train in parallel rather than back-to-back. The passed config must contain a bench_save_dir field, which is used to save the main model, the subset checkpoints, and the metadata that links them together.

LinearDatamodeling.train(
    lds_train_config,
    device=device,
    skip_subsets=True,
)

for idx in range(3):
    LinearDatamodeling.train_subset(
        lds_train_config,
        idx=idx,
        device=device,
    )

The companion scripts/train_lds_subset.py wraps train_subset as a CLI entry point that takes a single --idx, intended for SLURM array jobs.

Both methods accept either a config dict, a registered bench_id, or a path to a benchmark YAML.

Precomputing and reusing subset logits¶

The example below loads the published mnist_linear_datamodeling benchmark via load_pretrained, then populates the subset-logits and explanations caches before calling evaluate. The same subset_logits_dir can be passed to every evaluate(...) call regardless of explainer; the same explanations cache_dir + use_cached_expl=True can be reused whenever the explainer / expl_kwargs / eval-subsample match.

lds_config = "mnist_linear_datamodeling_unit"

lds_bench = LinearDatamodeling.load_pretrained(
    bench_id=lds_config,
    cache_dir=lds_cache,
    device=device,
)

expl_kwargs = {
    "model_id": "mnist_lds_tutorial",
    "layers": "fc_2",
    "cache_dir": os.path.join(cache_dir, "lds_captum_similarity"),
}
expl_cache_dir = os.path.join(cache_dir, "lds_explanations")

LinearDatamodeling.explain(
    config=lds_config,
    explainer_cls=CaptumSimilarity,
    expl_kwargs=expl_kwargs,
    cache_dir=expl_cache_dir,
    device=device,
    batch_size=8,
)

subset_logits_dir = LinearDatamodeling.cache_subset_logits(
    config=lds_config,
    cache_dir=os.path.join(cache_dir, "lds_subset_logits"),
    device=device,
    batch_size=8,
)

lds_results = lds_bench.evaluate(
    explainer_cls=CaptumSimilarity,
    expl_kwargs=expl_kwargs,
    cache_dir=expl_cache_dir,
    use_cached_expl=True,
    subset_logits_dir=subset_logits_dir,
    batch_size=8,
)
print(f"Linear Datamodeling Score: {lds_results['score']}")

Likewise, call explain once per explainer and reuse the returned cache directory across re-evaluations or across sibling benchmarks that share the same model + train/eval datasets via a common explanations_group in the YAML.

All of the classmethods on this page (load_pretrained, train, train_subset, explain, cache_subset_logits, cache_subset_logits_per_idx) accept bench_id / config either as a registered string (e.g. "mnist_linear_datamodeling"), a path to a benchmark YAML, or a config dict.