Skip to main content

Overview

Chat with Images uses AI vision to analyze images, extract text (OCR), describe visual content, and answer questions about what’s in the image.

Supported Formats

FormatExtensions
JPEG.jpg, .jpeg
PNG.png
GIF.gif
WebP.webp
BMP.bmp
TIFF.tiff
SVG.svg
HEIC.heic

How to Use (Web App)

1

Select Tool

Go to Chat with Images in your Skimming AI dashboard.
2

Upload Image

Upload an image file (screenshot, photo, scanned document, infographic, etc.).
3

Get Analysis

Skimming AI analyzes the image and extracts any text using OCR.
4

Ask Questions

Ask about the image content: “What does this chart show?” or “Summarize this infographic.”

How to Use (API)

Step 1: Ingest the Image

curl -X POST https://api.skimming.ai/ingest/v1/api/image \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "file=@screenshot.png"

Step 2: Chat with the Image

curl -X POST https://api.skimming.ai/chat/v1/api/image/png \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "file_id": "YOUR_FILE_ID",
    "question": "What text is in this image?",
    "streaming": false
  }'

Capabilities

FeatureDescription
OCRExtract text from images, scanned documents, screenshots
DescriptionGet a detailed description of visual content
Q&AAsk specific questions about the image
Charts/GraphsInterpret data visualizations

Limits

PlanFile Size
FreeUp to 50 MB
PaidUp to 500 MB

Use Cases

Document Scanning

Extract text from scanned documents and receipts.

Data Analysis

Interpret charts, graphs, and infographics.

Accessibility

Generate descriptions for visual content.

Research

Analyze diagrams, screenshots, and visual data.