Question 1

What are the key features of HF Dataset?

Accepted Answer

Search and discover datasets on the Hugging Face Hub. Fetch raw data rows from specific dataset splits. Perform full-text search within datasets using BM25. Filter dataset rows using SQL-like syntax. Retrieve dataset statistics and schema information.

Question 2

What can I use HF Dataset for?

Accepted Answer

Quickly exploring dataset contents without downloading large files. Programmatically filtering and searching through public ML datasets. Validating dataset accessibility and viewer feature support. Retrieving direct download links for Parquet files.

Question 3

What tools does HF Dataset provide?

Accepted Answer

search_datasets: Find datasets on the Hugging Face Hub by name, tag, or author.. validate_dataset: Check if a dataset is accessible and which viewer features are available.. list_splits: Get all available configurations and splits for a dataset.. get_dataset_info: Get the schema, metadata, and row counts for a dataset configuration.. get_rows: Fetch a slice of rows from a dataset split.. search_dataset: Full-text search within a dataset split using BM25 ranking.. filter_rows: Filter dataset rows using SQL-like WHERE conditions.. get_dataset_size: Get row counts and byte sizes for all configs and splits.. list_parquet_files: Get URLs for the dataset's Parquet files for direct download or processing.. get_statistics: Get descriptive statistics for each column in a dataset split..

Question 4

How do I install HF Dataset?

Accepted Answer

Install HF Dataset by running: npx @cfahlgren1/hf-dataset-mcp

Question 5

What are the requirements for HF Dataset?

Accepted Answer

HF Dataset requires the following environment variables: HF_TOKEN (optional), HF_DATASETS_SERVER (optional). You'll also need a compatible MCP client like Claude Desktop or Claude Code.

Question 6

Is HF Dataset free to use?

Accepted Answer

Yes, HF Dataset is open source and free to use. You can find the source code on GitHub.

Question 7

What MCP clients support HF Dataset?

Accepted Answer

HF Dataset works with any MCP-compatible client including Claude Desktop (Anthropic's official desktop app), Claude Code (CLI tool), Cursor, and other editors with MCP support.

Question 8

How do I configure HF Dataset?

Accepted Answer

Configure HF Dataset by adding it to your MCP client's config file. The setup block at the top of this page generates a ready-to-paste config for Claude Code, Cursor, Codex, Windsurf, and Claude Desktop.

Variable	Description
`HF_TOKEN`	Hugging Face API token (required for private/gated datasets)
`HF_DATASETS_SERVER`	Custom Dataset Viewer API URL (default: `https://datasets-server.huggingface.co`)

HF Dataset MCP Server

Add it to Claude Code

Make your agent remember this setup

What it does

Tools 10

Environment Variables

Try it

HF Dataset MCP

Installation

Configuration

Claude Desktop

Environment Variables

Tools

`search_datasets`

`validate_dataset`

`list_splits`

`get_dataset_info`

`get_rows`

`search_dataset`

`filter_rows`

`get_dataset_size`

`list_parquet_files`

`get_statistics`

Examples

Find text classification datasets

Get IMDB dataset info

Fetch rows from a dataset

Search for specific content

Filter rows

License

Frequently Asked Questions

Turn this server into reusable context

Add it to Claude Code

Make your agent remember this setup

What it does

Tools 10

Environment Variables

Try it

HF Dataset MCP

Installation

Configuration

Claude Desktop

Environment Variables

Tools

`search_datasets`

`validate_dataset`

`list_splits`

`get_dataset_info`

`get_rows`

`search_dataset`

`filter_rows`

`get_dataset_size`

`list_parquet_files`

`get_statistics`

Examples

Find text classification datasets

Get IMDB dataset info

Fetch rows from a dataset

Search for specific content

Filter rows

License

Frequently Asked Questions

Turn this server into reusable context

Related MCP Servers