VoucherVision API

Select one or two OCR engines below. Each engine has different capabilities and performance characteristics.

About the models:

Gemini 2.5 Pro: Higher accuracy, especially for handwritten text, can follow complex instructions, like parsing multiple species determinations. Can perform geolocation, search the internet to validate content, etc.
Gemini 2.0 Flash: Fast processing with good accuracy for most specimens
Gemini 1.5 Pro: Better at handwriting than Gemini 2.0 Flash, but will be deprecated in 2025, slower and more expensive than Gemini 2.0 Flash.
Gemini 2.5 Flash: A bit disappointing at the moment. Handwriting performance seems worse than Gemini 2.0 Flash
OCR Only: Extract text only (check box below)

OCR Engines (can select one or multiple):

LLM Model for creating JSON:

Please use gemini-2.0-flash and reserve gemini-2.5-pro for the prompts that are optimized to take full advantage of its capabilities (e.g. SLTPvM_geolocate_flag_multispecimen.yaml).

SLTPvM_geolocate_flag_multispecimen.yaml will flag sheets that have more than one specimen (barcodes) and will automatically geolocate specimens that lack GPS coordinates in the label text

World Flora Online (WFO) Validation:

Enable taxonomic validation against the World Flora Online database for plant specimens. This adds botanical name verification and taxonomic information to the results.

url	description	category
https://example.com/image1.jpg	Specimen 1	Category A
https://example.com/image2.jpg	Specimen 2	Category B
https://example.com/image3.jpg	Specimen 3	Category C

VoucherVisionGO API

API Settings

OCR Engines (can select one or multiple):

LLM Model for creating JSON:

World Flora Online (WFO) Validation:

Prompt Template:

Test with File Upload

Test with Image URL

Batch Process URLs

Batch Process Local Images

🗺️ Specimen Location Map (Last Processed)

Debug Information