LicensePlateRecognizer

ONNX inference class for performing license plates OCR.

The current OCR models available from the HUB are:

cct-s-v1-global-model: OCR model trained with global plates data. Based on Compact Convolutional Transformer (CCT) architecture. This is the S variant.
cct-xs-v1-global-model: OCR model trained with global plates data. Based on Compact Convolutional Transformer (CCT) architecture. This is the XS variant.
argentinian-plates-cnn-model: OCR for Argentinian license plates. Uses fully conv architecture.
argentinian-plates-cnn-synth-model: OCR for Argentinian license plates trained with synthetic and real data. Uses fully conv architecture.
european-plates-mobile-vit-v2-model: OCR for European license plates. Uses MobileVIT-2 for the backbone.
global-plates-mobile-vit-v2-model: OCR for global license plates (+65 countries). Uses MobileVIT-2 for the backbone.

Parameters:

Name	Type	Description	Default
`hub_ocr_model`	`OcrModel \| None`	Name of the OCR model to use from the HUB.	`None`
`device`	`Literal['cuda', 'cpu', 'auto']`	Device type for inference. Should be one of ('cpu', 'cuda', 'auto'). If 'auto' mode, the device will be deduced from `onnxruntime.get_available_providers()`.	`'auto'`
`providers`	`Sequence[str \| tuple[str, dict]] \| None`	Optional sequence of providers in order of decreasing precedence. If not specified, all available providers are used based on the device argument.	`None`
`sess_options`	`SessionOptions \| None`	Advanced session options for ONNX Runtime.	`None`
`onnx_model_path`	`PathLike \| None`	Path to ONNX model file to use (In case you want to use a custom one).	`None`
`plate_config_path`	`PathLike \| None`	Path to config file to use (In case you want to use a custom one).	`None`
`force_download`	`bool`	Force and download the model, even if it already exists.	`False`

Returns: None.

Source code in fast_plate_ocr/inference/plate_recognizer.py

def __init__(
    self,
    hub_ocr_model: OcrModel | None = None,
    device: Literal["cuda", "cpu", "auto"] = "auto",
    providers: Sequence[str | tuple[str, dict]] | None = None,
    sess_options: ort.SessionOptions | None = None,
    onnx_model_path: PathLike | None = None,
    plate_config_path: PathLike | None = None,
    force_download: bool = False,
) -> None:
    """
    Initializes the `LicensePlateRecognizer` with the specified OCR model and inference device.

    The current OCR models available from the HUB are:

    - `cct-s-v1-global-model`: OCR model trained with **global** plates data. Based on Compact
        Convolutional Transformer (CCT) architecture. This is the **S** variant.
    - `cct-xs-v1-global-model`: OCR model trained with **global** plates data. Based on Compact
        Convolutional Transformer (CCT) architecture. This is the **XS** variant.
    - `argentinian-plates-cnn-model`: OCR for **Argentinian** license plates. Uses fully conv
        architecture.
    - `argentinian-plates-cnn-synth-model`: OCR for **Argentinian** license plates trained with
        synthetic and real data. Uses fully conv architecture.
    - `european-plates-mobile-vit-v2-model`: OCR for **European** license plates. Uses
        MobileVIT-2 for the backbone.
    - `global-plates-mobile-vit-v2-model`: OCR for **global** license plates (+65 countries).
        Uses MobileVIT-2 for the backbone.

    Args:
        hub_ocr_model: Name of the OCR model to use from the HUB.
        device: Device type for inference. Should be one of ('cpu', 'cuda', 'auto'). If
            'auto' mode, the device will be deduced from
            `onnxruntime.get_available_providers()`.
        providers: Optional sequence of providers in order of decreasing precedence. If not
            specified, all available providers are used based on the device argument.
        sess_options: Advanced session options for ONNX Runtime.
        onnx_model_path: Path to ONNX model file to use (In case you want to use a custom one).
        plate_config_path: Path to config file to use (In case you want to use a custom one).
        force_download: Force and download the model, even if it already exists.
    Returns:
        None.
    """
    self.logger = logging.getLogger(__name__)

    if providers is not None:
        self.providers = providers
        self.logger.info("Using custom providers: %s", providers)
    else:
        if device == "cuda":
            self.providers = ["CUDAExecutionProvider"]
        elif device == "cpu":
            self.providers = ["CPUExecutionProvider"]
        elif device == "auto":
            self.providers = ort.get_available_providers()
        else:
            raise ValueError(
                f"Device should be one of ('cpu', 'cuda', 'auto'). Got '{device}'."
            )

        self.logger.info("Using device '%s' with providers: %s", device, self.providers)

    if onnx_model_path and plate_config_path:
        onnx_model_path = pathlib.Path(onnx_model_path)
        plate_config_path = pathlib.Path(plate_config_path)
        if not onnx_model_path.exists() or not plate_config_path.exists():
            raise FileNotFoundError("Missing model/config file!")
        self.model_name = onnx_model_path.stem
    elif hub_ocr_model:
        self.model_name = hub_ocr_model
        onnx_model_path, plate_config_path = hub.download_model(
            model_name=hub_ocr_model, force_download=force_download
        )
    else:
        raise ValueError(
            "Either provide a model from the HUB or a custom model_path and config_path"
        )

    self.config = PlateOCRConfig.from_yaml(plate_config_path)
    self.model = ort.InferenceSession(
        onnx_model_path, providers=self.providers, sess_options=sess_options
    )

benchmark ¶

benchmark(
    n_iter: int = 2500,
    batch_size: int = 1,
    include_processing: bool = False,
    warmup: int = 250,
) -> None

Run an inference benchmark and pretty print the results.

It reports the following metrics:

Average latency per batch (milliseconds)
Throughput in plates / second (PPS), i.e., how many plates the pipeline can process per second at the chosen batch_size.

Parameters:

Name	Type	Description	Default
`n_iter`	`int`	The number of iterations to run the benchmark. This determines how many times the inference will be executed to compute the average performance metrics.	`2500`
`batch_size`		Batch size to use for the benchmark.	`1`
`include_processing`	`bool`	Indicates whether the benchmark should include preprocessing and postprocessing times in the measurement.	`False`
`warmup`	`int`	Number of warmup iterations to run before the benchmark.	`250`

Source code in fast_plate_ocr/inference/plate_recognizer.py

def benchmark(
    self,
    n_iter: int = 2_500,
    batch_size: int = 1,
    include_processing: bool = False,
    warmup: int = 250,
) -> None:
    """
    Run an inference benchmark and pretty print the results.

    It reports the following metrics:

    * **Average latency per batch** (milliseconds)
    * **Throughput** in *plates / second* (PPS), i.e., how many plates the pipeline can process
      per second at the chosen ``batch_size``.

    Args:
        n_iter: The number of iterations to run the benchmark. This determines how many times
            the inference will be executed to compute the average performance metrics.
        batch_size : Batch size to use for the benchmark.
        include_processing: Indicates whether the benchmark should include preprocessing and
            postprocessing times in the measurement.
        warmup: Number of warmup iterations to run before the benchmark.
    """
    x = np.random.randint(
        0,
        256,
        size=(
            batch_size,
            self.config.img_height,
            self.config.img_width,
            self.config.num_channels,
        ),
        dtype=np.uint8,
    )

    # Warm-up
    for _ in range(warmup):
        if include_processing:
            self.run(x)
        else:
            self.model.run(None, {"input": x})

    # Timed loop
    cum_time = 0.0
    for _ in range(n_iter):
        with measure_time() as time_taken:
            if include_processing:
                self.run(x)
            else:
                self.model.run(None, {"input": x})
        cum_time += time_taken()

    avg_time_ms = cum_time / n_iter if n_iter else 0.0
    pps = (1_000 / avg_time_ms) * batch_size if n_iter else 0.0

    console = Console()
    model_info = Panel(
        Text(f"Model: {self.model_name}\nProviders: {self.providers}", style="bold green"),
        title="Model Information",
        border_style="bright_blue",
        expand=False,
    )
    console.print(model_info)
    table = Table(title=f"Benchmark for '{self.model_name}'", border_style="bright_blue")
    table.add_column("Metric", justify="center", style="cyan", no_wrap=True)
    table.add_column("Value", justify="center", style="magenta")

    table.add_row("Batch size", str(batch_size))
    table.add_row("Warm-up iters", str(warmup))
    table.add_row("Timed iterations", str(n_iter))
    table.add_row("Average Time / batch (ms)", f"{avg_time_ms:.4f}")
    table.add_row("Plates per Second (PPS)", f"{pps:.4f}")
    console.print(table)

run ¶

run(
    source: str | list[str] | NDArray | list[NDArray],
    return_confidence: bool = False,
) -> tuple[list[str], NDArray] | list[str]

Performs OCR to recognize license plate characters from an image or a list of images.

Parameters:

Name	Type	Description	Default
`source`	`str \| list[str] \| NDArray \| list[NDArray]`	One or more image inputs, which can be: A file path (`str` or `PathLike`) to an image. A list of file paths. A NumPy array of a single image, with shape (H, W), (H, W, 1) or (H, W, 3). A list of NumPy arrays, each representing an image. A 4D NumPy array of shape (N, H, W, C), ready for inference. Images will be automatically resized and converted as needed based on the model's configuration (including color mode and aspect ratio settings).	required
`return_confidence`	`bool`	Whether to return confidence scores along with plate predictions.	`False`

Returns:

Type	Description
`tuple[list[str], NDArray] \| list[str]`	A list of recognized license plates (one per image). If `return_confidence` is True,
`tuple[list[str], NDArray] \| list[str]`	also returns a NumPy array of shape `(N, plate_slots)` containing the confidence scores
`tuple[list[str], NDArray] \| list[str]`	for each predicted character.

Source code in fast_plate_ocr/inference/plate_recognizer.py

def run(
    self,
    source: str | list[str] | npt.NDArray | list[npt.NDArray],
    return_confidence: bool = False,
) -> tuple[list[str], npt.NDArray] | list[str]:
    """
    Performs OCR to recognize license plate characters from an image or a list of images.

    Args:
        source: One or more image inputs, which can be:

            - A file path (`str` or `PathLike`) to an image.
            - A list of file paths.
            - A NumPy array of a single image, with shape (H, W), (H, W, 1) or (H, W, 3).
            - A list of NumPy arrays, each representing an image.
            - A 4D NumPy array of shape (N, H, W, C), ready for inference.

            Images will be automatically resized and converted as needed based on the model's
            configuration (including color mode and aspect ratio settings).

        return_confidence: Whether to return confidence scores along with plate predictions.

    Returns:
        A list of recognized license plates (one per image). If `return_confidence` is True,
        also returns a NumPy array of shape `(N, plate_slots)` containing the confidence scores
        for each predicted character.
    """
    x = _load_image_from_source(source, self.config)
    # Preprocess
    x = preprocess_image(x)
    # Run model
    y: list[npt.NDArray] = self.model.run(None, {"input": x})
    # Postprocess model output
    return postprocess_output(
        y[0],
        self.config.max_plate_slots,
        self.config.alphabet,
        return_confidence=return_confidence,
    )