BEEspoke Data

community

https://www.bees.org/

AI & ML interests

'an LLM is only as good as the dataset it was trained on' - Sun Tzu

Recent Activity

kenhktsui authored a paper 28 days ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

huu-ontocord authored a paper about 1 month ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

pszemraj updated a model about 1 month ago

BEE-spoke-data/neobert-100k-test

View all activity

BEE-spoke-data 's collections 8