API Settings
Select one or two OCR engines below. Each engine has different capabilities and performance characteristics.
About the models:
- Gemini 2.5 Pro: Higher accuracy, especially for handwritten text, can follow complex instructions, like parsing multiple species determinations. Can perform geolocation, search the internet to validate content, etc.
- Gemini 2.0 Flash: Fast processing with good accuracy for most specimens
- Gemini 1.5 Pro: Better at handwriting than Gemini 2.0 Flash, but will be deprecated in 2025, slower and more expensive than Gemini 2.0 Flash.
- Gemini 2.5 Flash: A bit disappointing at the moment. Handwriting performance seems worse than Gemini 2.0 Flash
- OCR Only: Extract text only (check box below)
OCR Engines (can select one or multiple):
LLM Model for creating JSON:
Please use gemini-2.0-flash and reserve gemini-2.5-pro for the prompts that are optimized to take full advantage of its capabilities (e.g. SLTPvM_geolocate_flag_multispecimen.yaml).
SLTPvM_geolocate_flag_multispecimen.yaml will flag sheets that have more than one specimen (barcodes) and will automatically geolocate specimens that lack GPS coordinates in the label text
Prompt Template:
File Upload
Image URL
Batch URLs
Batch Folder
Test with File Upload
Test with Image URL
Batch Process URLs
Upload a text file (.txt) with one URL per line or a CSV file (.csv) with URLs in a column.
Example CSV file format:
url | description | category |
---|---|---|
https://example.com/image1.jpg | Specimen 1 | Category A |
https://example.com/image2.jpg | Specimen 2 | Category B |
https://example.com/image3.jpg | Specimen 3 | Category C |
8
Batch Process Local Images
Drop image files here or
8
🗺️ Specimen Location Map (Last Processed)
⚠️ Waiting for results with coordinates...