Iconclass Classification Pipeline

A reproducible pipeline to classify digital objects using Vision-Language Models for Iconclass classification. Supports both local (Ollama) and cloud-based (OpenRouter) backends.

Overview

This project implements an automated pipeline for classifying artwork images with Iconclass codes. The pipeline downloads images, processes them, classifies them using VLMs, and writes the results back to structured metadata files with full provenance tracking.

Features

🎨 Multiple Backends: Local (Ollama Iconclass VLM) or Cloud (OpenRouter Qwen3-VL)
📦 Batch Processing: Process entire collections from metadata.json files
🔄 Image Processing: Automatic download, resize, and normalization of images
💾 Smart Caching: SHA256-based deduplication of downloaded images
📊 Dual Output: Compact codes in metadata + detailed classification records
🔍 Full Provenance: Timestamped runs with complete audit trail
🛡️ Robust: Retry logic, error handling, and comprehensive logging
📐 Type-Safe: Built with Pydantic models for data validation

Installation

Prerequisites

For Ollama backend:

Python ≥ 3.11
Ollama running locally

The Iconclass VLM model pulled in Ollama:

ollama pull hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M

For OpenRouter backend:

Python ≥ 3.11
OpenRouter API key from openrouter.ai

Setup (with uv)

Clone the repository:

git clone https://github.com/Stadt-Geschichte-Basel/iconclass-classification.git
cd iconclass-classification

Install uv if needed:

curl -LsSf https://astral.sh/uv/install.sh | sh
# or: pip install uv

Sync dependencies (creates and populates .venv):

uv sync
# (Optional) include dev tools (ruff, ty, pytest)
uv sync --group dev

Verify CLI is available:

uv run iconclass-classification --help

Usage

Important: Data Filtering

The pipeline automatically filters data to process only children objects:

✅ Processes: Objects with m prefix (children/individual objects)
❌ Excludes: Objects with abb prefix (parents/aggregates)

This filtering happens automatically before sampling. You don’t need to pre-filter your data.

Basic Usage (Ollama)

Classify images from a metadata.json URL (processes all children objects):

uv run iconclass-classification classify-ollama \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M

Sampling Modes

Random Sampling (recommended for testing)

Process a random sample with a fixed seed for reproducibility:

uv run iconclass-classification classify-ollama \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M \
  --sampling-mode random \
  --sampling-size 10 \
  --sampling-seed 42

Fixed Sampling (specific objects)

Process only specific objects listed in a file:

# Create file with object IDs (one per line)
echo "m10039" > my_objects.txt
echo "m10040" >> my_objects.txt

uv run iconclass-classification classify-ollama \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M \
  --sampling-mode fixed \
  --fixed-ids-file my_objects.txt

Full Dataset

Process all children objects (default if no sampling specified):

uv run iconclass-classification classify-ollama \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M \
  --sampling-mode full

Prompt Templates

The pipeline includes three prompt templates optimized for different scenarios:

Default (fastest, good for testing)

uv run iconclass-classification classify-ollama \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M \
  --prompt-template default \
  --sampling-mode random --sampling-size 10

Instruction (recommended for production)

More detailed instructions with explicit NONE fallback:

uv run iconclass-classification classify-ollama \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M \
  --prompt-template instruction \
  --sampling-mode random --sampling-size 10

Few-Shot (best for complex images)

Includes example classifications to guide the model:

uv run iconclass-classification classify-ollama \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M \
  --prompt-template few_shot \
  --sampling-mode random --sampling-size 10

OpenRouter Backend (Cloud-based)

Basic Usage

To use OpenRouter with Qwen3-VL, you need an API key:

# Set your API key as environment variable
export OPENROUTER_API_KEY=your_key_here

# Run classification
uv run iconclass-classification classify-openrouter \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model qwen/qwen3-vl-235b-a22b-instruct \
  --sampling-mode random --sampling-size 10

Using a .env file (recommended)

Store your API key in a local .env file and load it for your shell session (zsh on macOS/Linux):

# Create .env at the project root
echo 'OPENROUTER_API_KEY=your_key_here' > .env

# Export variables from .env into the environment
set -a; source .env; set +a

# Run with OpenRouter
uv run iconclass-classification classify-openrouter \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model qwen/qwen3-vl-235b-a22b-instruct \
  --sampling-mode random --sampling-size 10

Notes:

Do not commit .env to version control.
You can also provide the key once per session with export OPENROUTER_API_KEY=....

With Different Models

OpenRouter supports various models. You can specify a different model (example with Qwen3-VL 235B):

uv run iconclass-classification classify-openrouter \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model qwen/qwen3-vl-235b-a22b-instruct \
  --sampling-mode random --sampling-size 10

Prompt Templates with OpenRouter

Just like Ollama, you can use different prompt templates:

uv run iconclass-classification classify-openrouter \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --prompt-template instruction \
  --sampling-mode random --sampling-size 10

Advanced Options (Ollama)

uv run iconclass-classification classify-ollama \
  --source https://forschung.stadtgeschichtebasel.ch/assets/data/metadata.json \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M \
  --ollama-url http://localhost:11434 \
  --prompt-template instruction \
  --sampling-mode random \
  --sampling-size 100 \
  --sampling-seed 42 \
  --max-side 1024 \
  --quality 92 \
  --top-k 5 \
  --output runs \
  --temperature 0.0 \
  --num-ctx 4096 \
  --num-predict 128

Command-Line Options

Ollama Backend (`classify`)

Option	Default	Description
`--source`	required	URL to metadata.json
`--model`	`hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M`	Ollama model name
`--ollama-url`	`http://localhost:11434`	Ollama service URL
`--prompt-template`	`default`	Prompt template (default, instruction, few_shot)
`--sampling-mode`	`full`	Sampling mode (random, fixed, full)
`--sampling-size`	`None`	Number of objects to sample (random mode)
`--sampling-seed`	`42`	Random seed for reproducibility
`--fixed-ids-file`	`None`	File with object IDs (fixed mode)
`--max-side`	`1024`	Maximum image side length in pixels
`--quality`	`92`	JPEG quality (1-100)
`--top-k`	`None`	Maximum number of codes per image
`--output`	`runs`	Base output directory
`--temperature`	`0.0`	Model temperature
`--num-ctx`	`4096`	Context window size
`--num-predict`	`128`	Maximum tokens to predict

OpenRouter Backend (`classify-openrouter`)

Option	Default	Description
`--source`	required	URL to metadata.json
`--api-key`	required (or `OPENROUTER_API_KEY`)	OpenRouter API key
`--model`	`qwen/qwen3-vl-235b-a22b-instruct`	OpenRouter model name
`--prompt-template`	`default`	Prompt template (default, instruction, few_shot)
`--sampling-mode`	`full`	Sampling mode (random, fixed, full)
`--sampling-size`	`None`	Number of objects to sample (random mode)
`--sampling-seed`	`42`	Random seed for reproducibility
`--fixed-ids-file`	`None`	File with object IDs (fixed mode)
`--max-side`	`2048`	Maximum image side length in pixels
`--quality`	`92`	JPEG quality (1-100)
`--top-k`	`None`	Maximum number of codes per image
`--output`	`runs`	Base output directory
`--temperature`	`0.0`	Model temperature
`--num-predict`	`128`	Maximum tokens to predict

Output Structure

Each run creates a timestamped directory with the following structure:

runs/<UTC-ISO8601>/
  raw/
    metadata.json              # Original metadata
  data/
    src/                       # Cached raw images (by SHA256)
    <objectid>.jpg             # Processed images
  classify/
    <objectid>_request.json    # Classification requests
    <objectid>_response.json   # Classification responses
  logs/
    pipeline.log               # Detailed logs
  results/
    metadata.classified.json   # Metadata with subject codes
    iconclass_details.jsonl    # Detailed classification records
  manifest.json                # Run metadata

Output Formats

Compact Metadata (metadata.classified.json)

Each object gains a flat subject array with Iconclass codes:

{
  "objectid": "abb10039",
  "title": "Die Löblich und wyt berümpt Stat Basel",
  "subject": ["71H7131", "25F2"]
}

Detailed Classification (iconclass_details.jsonl)

One JSON record per line with complete metadata:

{
  "objectid": "abb10039",
  "subject": {
    "iconclass": {
      "codes": ["71H7131", "25F2"],
      "top_k": [
        { "code": "71H7131", "rank": 1 },
        { "code": "25F2", "rank": 2 }
      ],
      "model": "hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M",
      "prompt": "Generate Iconclass labels for this image",
      "temperature": 0.0,
      "num_ctx": 4096,
      "num_predict": 128,
      "raw_text": "<model output>",
      "image_sha256": "<sha256>",
      "image_source": "<URL>",
      "processed_image_path": "data/abb10039.jpg",
      "timestamp": "2025-11-01T10:00:00Z"
    }
  }
}

Development

Running Tests (uv)

# Ensure dev dependencies installed
uv sync --group dev

# Run tests
uv run pytest test/ -v

Code Quality

# Format code
uv run ruff format .

# Check code
uv run ruff check .

# Auto-fix issues
uv run ruff check --fix .

# (Optional) Type check
uv run ty check

Project Structure

src/iconclass_classification/
  __init__.py           # Package initialization
  __main__.py           # Entry point
  cli.py                # Command-line interface
  models.py             # Pydantic data models
  image_utils.py        # Image processing utilities
  ollama_client.py      # Ollama API client
  pipeline.py           # Main pipeline orchestration

test/
  unit/                 # Unit tests
    test_image_utils.py

Architecture

The pipeline consists of several modular components:

Data Models (models.py): Pydantic models for type-safe data handling
Image Processing (image_utils.py): Download, resize, normalize, and cache images
Ollama Client (ollama_client.py): Interface with Ollama API for classification
Pipeline (pipeline.py): Orchestrates the entire workflow
CLI (cli.py): User-friendly command-line interface

How It Works

Fetch Metadata: Download metadata.json from the source URL
Select Images: Choose best image URL (prefer object_location, fallback to object_thumb)
Download & Cache: Download images with SHA256-based deduplication
Process Images: Resize to max_side, convert to RGB, compress as JPEG
Classify: Send processed images to Ollama Iconclass VLM
Extract Codes: Parse Iconclass codes from model responses
Write Results: Update metadata with codes, save detailed records
Log Everything: Save requests, responses, and run manifest

Security & Ethics

✅ Only HTTPS downloads supported
✅ No storage of base64-encoded images on disk
✅ Respects image licensing information
✅ Classifications treated as metadata (recommended CC0 publication)

Performance

Caching: Images deduplicated by SHA256 hash
Concurrency: Sequential processing (parallel processing can be added)
Resumability: Each run is independent; failed runs can be retried

Troubleshooting

For detailed troubleshooting, see training-and-prompting.md.

Empty Classifications

If objects are returning no Iconclass codes:

Check the logs: runs/<timestamp>/logs/pipeline.log contains debug information
Review responses: Check classify/<objectid>_response.json for model output
Try different prompts: Use --prompt-template instruction or few_shot
Inspect images: Check data/<objectid>.jpg for quality issues

Ollama Not Running

Ensure Ollama is running:

ollama serve

Model Not Found

Pull the Iconclass VLM model:

ollama pull hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M

Memory Issues

Reduce image size or batch size:

uv run iconclass-classification classify-ollama \
  --source <URL> \
  --model hf.co/mradermacher/iconclass-vlm-GGUF:Q4_K_M \
  --max-side 512 \
  --sampling-mode random --sampling-size 10

Contributing

See CONTRIBUTING.md for development guidelines.

License

Code: AGPL-3.0
Data: CC BY 4.0

Citation

If you use this project in your research, please cite:

@software{iconclass_classification,
  title = {Iconclass Classification Pipeline},
  author = {Stadt Geschichte Basel},
  year = {2025},
  url = {https://github.com/Stadt-Geschichte-Basel/iconclass-classification}
}

Support

For questions, issues, or contributions:

Acknowledgments

This project uses:

Ollama for local LLM hosting
Iconclass classification system
Pydantic for data validation
Pillow for image processing