Documentation Index
Fetch the complete documentation index at: https://documentation.datalab.to/llms.txt
Use this file to discover all available pages before exploring further.
Datalab supports the following file types for document conversion:
PDF
| Extension | MIME Type |
|---|
.pdf | application/pdf |
Spreadsheets
| Extension | MIME Type |
|---|
.xls | application/vnd.ms-excel |
.xlsx | application/vnd.openxmlformats-officedocument.spreadsheetml.sheet |
.xlsm | application/vnd.ms-excel.sheet.macroEnabled.12 |
.xltx | application/vnd.openxmlformats-officedocument.spreadsheetml.template |
.csv | text/csv |
.ods | application/vnd.oasis.opendocument.spreadsheet |
Word Documents
| Extension | MIME Type |
|---|
.doc | application/msword |
.docx | application/vnd.openxmlformats-officedocument.wordprocessingml.document |
.odt | application/vnd.oasis.opendocument.text |
Presentations
| Extension | MIME Type |
|---|
.ppt | application/vnd.ms-powerpoint |
.pptx | application/vnd.openxmlformats-officedocument.presentationml.presentation |
.odp | application/vnd.oasis.opendocument.presentation |
HTML
| Extension | MIME Type |
|---|
.html | text/html |
Ebooks
| Extension | MIME Type |
|---|
.epub | application/epub+zip |
Images
| Extension | MIME Type |
|---|
.png | image/png |
.jpg | image/jpeg |
.jpeg | image/jpeg |
.webp | image/webp |
.gif | image/gif |
.tiff | image/tiff |
Detecting MIME Types
To automatically detect a file’s MIME type in Python:
import filetype
mime = filetype.guess("document.pdf")
if mime:
print(mime.mime) # application/pdf
Install with pip install filetype.
Size Limits
See API Limits for file size and page limits.
Next Steps
Quickstart
Get started converting documents in minutes.
Document Conversion
Detailed guide to converting documents to Markdown, HTML, or JSON.
API Limits
Understand file size limits, page limits, and rate limiting.
File Upload
Upload files to Datalab storage for use in pipelines.