VoucherVisionGO API

API Settings

Select one or two OCR engines below. Each engine has different capabilities and performance characteristics.

About the models:

  • Gemini 2.5 Pro: Higher accuracy, especially for handwritten text, can follow complex instructions, like parsing multiple species determinations. Can perform geolocation, search the internet to validate content, etc.
  • Gemini 2.0 Flash: Fast processing with good accuracy for most specimens
  • Gemini 1.5 Pro: Better at handwriting than Gemini 2.0 Flash, but will be deprecated in 2025, slower and more expensive than Gemini 2.0 Flash.
  • Gemini 2.5 Flash: A bit disappointing at the moment. Handwriting performance seems worse than Gemini 2.0 Flash
  • OCR Only: Extract text only (check box below)

OCR Engines (can select one or multiple):

LLM Model for creating JSON:

Please use gemini-2.0-flash and reserve gemini-2.5-pro for the prompts that are optimized to take full advantage of its capabilities (e.g. SLTPvM_geolocate_flag_multispecimen.yaml).

SLTPvM_geolocate_flag_multispecimen.yaml will flag sheets that have more than one specimen (barcodes) and will automatically geolocate specimens that lack GPS coordinates in the label text

Prompt Template:

File Upload
Image URL
Batch URLs
Batch Folder

Test with File Upload

Image Preview

Test with Image URL

Batch Process URLs

Upload a text file (.txt) with one URL per line or a CSV file (.csv) with URLs in a column.

Example CSV file format:

url description category
https://example.com/image1.jpg Specimen 1 Category A
https://example.com/image2.jpg Specimen 2 Category B
https://example.com/image3.jpg Specimen 3 Category C
8

Batch Process Local Images

Drop image files here or

8

🗺️ Specimen Location Map (Last Processed)

⚠️ Waiting for results with coordinates...

Debug Information