> ## Documentation Index
> Fetch the complete documentation index at: https://docs.eigenai.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Datasets

> Off-the-shelf datasets generated by EigenData-CLI for agent evaluation and training, spanning diverse domains and task complexities.

EigenData-CLI generates high-quality datasets for agent evaluation and training. Below is a catalog of off-the-shelf datasets spanning different domains and task complexities. Each comes with a free, ready-to-use **demo sample**; where a full production corpus is available, the dataset page describes the complete dataset and how to license it.

Each dataset includes:

* **Environment** — the simulated world state (MCP server snapshots, databases, or filesystems) that the agent operates in
* **Data** — generated samples including intents, datapoints, evaluators, and reference payloads

## Available Datasets

<CardGroup cols={2}>
  <Card title="APEX Agent" icon="building-columns" href="/products/eigendata-cli/datasets/apex-agent/overview">
    Professional knowledge work across investment banking, law, and management consulting — synthesized from scratch inspired by the APEX benchmark.
  </Card>

  <Card title="Personal Agent Bench" icon="laptop" href="/products/eigendata-cli/datasets/personal-agent-bench/overview">
    Long-horizon personal knowledge-work on a simulated laptop — tax packets, federal returns, reimbursements, and subscription audits across an 8-app environment.
  </Card>

  <Card title="Tau2-Bench" icon="headset" href="/products/eigendata-cli/datasets/tau2-bench/overview">
    Multi-turn, policy-grounded customer-service dialogs across airline, telecom, and retail, with tool use and machine-checkable success criteria.
  </Card>

  <Card title="Tau3-Bench" icon="lock" href="/products/eigendata-cli/datasets/tau3-bench/overview">
    Hard, single-domain retail-banking dialogs with dynamically discoverable tools — the agent must search a knowledge base and unlock the right tool at runtime.
  </Card>

  <Card title="Enterprise Bench" icon="building" href="/products/eigendata-cli/datasets/enterprise-bench/overview">
    Long-horizon agent tasks inside realistic simulated companies — operate the business or answer questions across up to \~40 connected SaaS systems sharing one world state.
  </Card>

  <Card title="WildClawBench" icon="terminal" href="/products/eigendata-cli/datasets/wildclaw-bench/overview">
    Agentic, tool-using tasks across six capability categories — from PDF parsing to code debugging to safety alignment — built on InternLM's WildClawBench.
  </Card>

  <Card title="MCP-Atlas" icon="diagram-project" href="/products/eigendata-cli/datasets/mcp-atlas/overview">
    Multi-step, multi-server tool-use tasks over a \~40-server MCP graph — each frozen with a claims-based reward and a replayable environment snapshot. Built on the MCP-Atlas benchmark.
  </Card>

  <Card title="MCPMark" icon="check-double" href="/products/eigendata-cli/datasets/mcp-mark/overview">
    Synthetic, agentic filesystem + GitHub tasks with deterministic Python verifiers — repo archaeology, cross-file joins, and stateful MCP actions, runnable fully offline.
  </Card>

  <Card title="Toolathlon" icon="medal" href="/products/eigendata-cli/datasets/toolathlon/overview">
    Single-turn, tool-using tasks over a shared multi-application MCP workspace — 32 tool servers, 102 task families, 4,300 RL environments with deterministic grading.
  </Card>

  <Card title="Google Workspace" icon="google" href="/products/eigendata-cli/datasets/google-workspace">
    Everyday Google Workspace tasks — managing emails, calendars, sheets, and contacts across diverse personal and professional scenarios.
  </Card>
</CardGroup>

## Download

The free demo samples are hosted on Hugging Face:

```bash theme={null}
# Download everything
hf download jindidi/eigendata-demo-data --repo-type dataset

# Download a specific dataset
hf download jindidi/eigendata-demo-data --repo-type dataset --include "tau2_bench/*"
```

<Card title="Browse on Hugging Face" icon="face-smile" href="https://huggingface.co/datasets/jindidi/eigendata-demo-data">
  View and download all demo samples
</Card>

## License

The demo samples are released under [CC BY-NC-ND 4.0](https://creativecommons.org/licenses/by-nc-nd/4.0/).

* For demonstration and evaluation purposes only
* No commercial use
* No redistribution or derivative works
* No use for model training

Full dataset corpora are available for commercial licensing, including model training — see each dataset's page or contact [support@eigenai.com](mailto:support@eigenai.com).
