Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ByteDance
/
Dolphin-1.5
like
14
Follow
ByteDance
3.66k
Image-Text-to-Text
Transformers
Safetensors
custom
Chinese
English
vision-encoder-decoder
image-to-text
document-parsing
document-understanding
document-intelligence
ocr
layout-analysis
table-extraction
multimodal
vision-language-model
License:
mit
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Dolphin-1.5
808 MB
1 contributor
History:
3 commits
HaoFeng2025
Update README.md
8a2c39a
verified
11 days ago
.gitattributes
Safe
1.52 kB
initial commit
12 days ago
README.md
Safe
3.51 kB
Update README.md
11 days ago
config.json
Safe
4.85 kB
Upload 7 files
11 days ago
generation_config.json
Safe
160 Bytes
Upload 7 files
11 days ago
model.safetensors
796 MB
xet
Upload 7 files
11 days ago
preprocessor_config.json
Safe
477 Bytes
Upload 7 files
11 days ago
special_tokens_map.json
Safe
277 Bytes
Upload 7 files
11 days ago
tokenizer.json
Safe
7.86 MB
Upload 7 files
11 days ago
tokenizer_config.json
Safe
4.05 MB
Upload 7 files
11 days ago