Skip to content

Dataset

This page describes the expected format for datasets used when training models with fast-plate-ocr.


Expected File Format

Your dataset should be provided as a CSV file with the following structure:

Column Name Type Description
image_path str Relative path to the license plate image
plate_text str Ground-truth text on the plate (no padding)

Relative Paths

Image paths in the CSV are resolved relative to the location of the CSV file, not the working directory.


Dataset Structure Example

The dataset should include at least a CSV file (for training and validation) and a folder with corresponding plate images.

dataset/
├── train/
│   ├── annotations.csv
│   └── images/
│       ├── 00001.jpg
│       ├── 00002.jpg
│       ├── 00003.jpg
│       ├── 00004.jpg
│       └── 00005.jpg
└── val/
    ├── annotations.csv
    └── images/
        ├── 00006.jpg
        └── 00007.jpg
train/annotations.csv
image_path,plate_text
images/00001.jpg,KNN505
images/00002.jpg,J00NCW
images/00003.jpg,48593
images/00004.jpg,AB123CD
images/00005.jpg,17AB
val/annotations.csv
image_path,plate_text
images/00006.jpg,NFM374
images/00007.jpg,ZXC9871