Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Reset Other
Document_Segmentation
art
Synthetic
medical
code
biology
finance
legal
chemistry
agent
music
climate
Apply filters
Datasets
334
Full-text search
Edit filters
Sort: Trending
Active filters:
wikipedia
Clear all
open-index/open-wikipedia-markdown
Viewer
•
Updated
about 22 hours ago
•
139k
•
857
•
4
OxAISH-AL-LLM/wiki_toxic
Viewer
•
Updated
Sep 19, 2022
•
249k
•
499
•
24
jpwahle/machine-paraphrase-dataset
Viewer
•
Updated
Jun 15, 2025
•
585k
•
281
•
7
rag-datasets/rag-mini-wikipedia
Viewer
•
Updated
Jun 2, 2024
•
4.12k
•
1.7k
•
46
agentlans/wikipedia-paragraphs-complete
Viewer
•
Updated
Aug 21, 2025
•
3.32M
•
79
•
1
open-index/open-wikipedia
Viewer
•
Updated
about 22 hours ago
•
139k
•
806
•
1
Exr0n/wiki-entity-similarity
Viewer
•
Updated
Aug 19, 2022
•
37.8M
•
190
•
10
olm/olm-wikipedia-20220920
Viewer
•
Updated
Oct 18, 2022
•
6.55M
•
208
olm/olm-wikipedia-20220701
Viewer
•
Updated
Oct 18, 2022
•
6.52M
•
86
olm/olm-wikipedia-20221001
Viewer
•
Updated
Oct 18, 2022
•
6.55M
•
199
•
1
dennlinger/wiki-paragraphs
Viewer
•
Updated
Oct 13, 2022
•
12M
•
65
jpwahle/autoencoder-paraphrase-dataset
Viewer
•
Updated
Jun 15, 2025
•
2.62M
•
74
•
2
jpwahle/autoregressive-paraphrase-dataset
Viewer
•
Updated
Nov 19, 2022
•
262k
•
33
•
1
Genius1237/TyDiP
Viewer
•
Updated
Oct 15, 2023
•
4.43k
•
128
statworx/leipzip-swiss
Viewer
•
Updated
Nov 21, 2022
•
600k
•
20
•
2
TUKE-DeutscheTelekom/skquad
Viewer
•
Updated
Dec 5, 2024
•
91.2k
•
137
•
8
RussianNLP/wikiomnia
Updated
Apr 7, 2023
•
194
•
16
olm/olm-wikipedia-20221220
Viewer
•
Updated
Dec 29, 2022
•
6.59M
•
586
•
9
dmargutierrez/Babelscape-wikineural-joined
Viewer
•
Updated
Mar 16, 2023
•
1.03M
•
23
•
1
TUKE-DeutscheTelekom/squad-sk
Viewer
•
Updated
Oct 18, 2023
•
136k
•
22
•
2
cyanic-selkie/aida-conll-yago-wikidata
Viewer
•
Updated
Jun 28, 2023
•
1.39k
•
402
•
9
cyanic-selkie/wikianc-hr
Viewer
•
Updated
Jun 1, 2023
•
2.7M
•
14
•
1
cyanic-selkie/wikianc-en
Viewer
•
Updated
Jun 2, 2023
•
43.1M
•
23
•
1
armvectores/hy_wikipedia_2023
Viewer
•
Updated
Apr 9, 2023
•
297k
•
15
armvectores/hyw_wikipedia_2023
Viewer
•
Updated
Apr 9, 2023
•
10.8k
•
9
•
1
chrisociepa/wikipedia-pl-20230401
Viewer
•
Updated
Apr 17, 2023
•
1.56M
•
123
•
3
mingaflo/rebel-dataset-de
Viewer
•
Updated
Apr 20, 2023
•
399k
•
14
abokbot/wikipedia-first-paragraph
Viewer
•
Updated
Jun 4, 2023
•
6.46M
•
50
•
4
KShivendu/wikipedia-1k-cohere-openai-embeddings
Viewer
•
Updated
Jul 20, 2023
•
1k
•
98
•
2
MichaelR207/MultiSim
Updated
Nov 14, 2023
•
628
•
8
Previous
1
2
3
...
12
Next