Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
Modalities
3D
Audio
Document
Geospatial
Image
Tabular
Text
Time-series
Video
Size (rows)
Reset Size
1M
10M
Format
json
csv
parquet
imagefolder
soundfolder
webdataset
text
arrow
Apply filters
Datasets
16,137
Full-text search
Edit filters
Sort: Trending
Active filters:
1M<n<10M
Clear all
nick007x/arxiv-papers
Viewer
•
Updated
Oct 14
•
2.55M
•
10.5k
•
85
InternRobotics/InternData-A1
Viewer
•
Updated
2 days ago
•
1.84M
•
8.57k
•
46
allenai/Dolci-Think-SFT-32B
Viewer
•
Updated
5 days ago
•
2.25M
•
2.08k
•
17
facebook/sam-3d-body-dataset
Viewer
•
Updated
11 days ago
•
5.66M
•
2.67k
•
28
ILSVRC/imagenet-1k
Viewer
•
Updated
Sep 17
•
1.43M
•
61.8k
•
613
neulab/agent-data-collection
Preview
•
Updated
5 days ago
•
10.4k
•
99
lmsys/lmsys-chat-1m
Viewer
•
Updated
Jul 27, 2024
•
1M
•
6.08k
•
767
ai4bharat/IndicVoices
Viewer
•
Updated
May 21
•
4.8M
•
6.75k
•
38
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
•
Updated
May 8
•
3.91M
•
6.75k
•
608
nvidia/OpenMathReasoning
Viewer
•
Updated
May 27
•
5.68M
•
13.4k
•
361
HuggingFaceTB/smoltalk2
Viewer
•
Updated
about 1 month ago
•
8.61M
•
8.89k
•
126
ytu-ce-cosmos/Cosmos-Turkish-Corpus-v1.0
Viewer
•
Updated
about 16 hours ago
•
9.08M
•
12
•
5
nyu-mll/glue
Viewer
•
Updated
Jan 30, 2024
•
1.49M
•
385k
•
456
nvidia/Nemotron-Personas-USA
Viewer
•
Updated
Oct 28
•
1M
•
6.07k
•
228
allenai/Dolci-Instruct-SFT-7B
Viewer
•
Updated
5 days ago
•
2.15M
•
1.39k
•
9
Wakals/CoVT-Dataset
Viewer
•
Updated
8 days ago
•
1.17M
•
1.53k
•
5
Salesforce/wikitext
Viewer
•
Updated
Jan 4, 2024
•
3.71M
•
965k
•
525
JeanKaddour/minipile
Viewer
•
Updated
Jun 20, 2023
•
1.01M
•
3.18k
•
132
Salesforce/lotsa_data
Viewer
•
Updated
Jan 21
•
3.97M
•
42.2k
•
91
nkp37/OpenVid-1M
Viewer
•
Updated
Jul 14
•
1.45M
•
25.9k
•
236
common-pile/caselaw_access_project
Viewer
•
Updated
Jun 6
•
5.52M
•
2.75k
•
204
DAMO-NLP-SG/multimodal_textbook
Updated
Mar 17
•
5.18k
•
156
a-m-team/AM-DeepSeek-R1-Distilled-1.4M
Preview
•
Updated
Mar 30
•
1.61k
•
167
BytedTsinghua-SIA/DAPO-Math-17k
Viewer
•
Updated
Apr 18
•
1.79M
•
8.1k
•
122
Agent-Ark/Toucan-1.5M
Viewer
•
Updated
Oct 4
•
1.65M
•
11.9k
•
180
nvidia/Nemotron-VLM-Dataset-v2
Viewer
•
Updated
5 days ago
•
4.58M
•
10.7k
•
67
openbmb/InfLLM-V2-data-5B
Viewer
•
Updated
Oct 25
•
7.19M
•
24
•
3
microsoft/ms_marco
Viewer
•
Updated
Jan 4, 2024
•
1.11M
•
15.5k
•
203
Muennighoff/natural-instructions
Viewer
•
Updated
Dec 23, 2022
•
7.15M
•
4.29k
•
73
wangrui6/Zhihu-KOL
Viewer
•
Updated
Apr 23, 2023
•
1.01M
•
739
•
249
Previous
1
2
3
...
100
Next