DIFFUSIONSTR: DIFFUSION MODEL FOR SCENE TEXT RECOGNITION

Masato Fujitake

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Lecture 09 Oct 2023

This paper presents Diffusion Model for Scene Text Recognition (DiffusionSTR), an end-to-end text recognition framework using diffusion models for recognizing text in the wild. While existing studies have viewed the scene text recognition task as an image-to-text transformation, we rethought it as a text-text one under images in a diffusion model. We show for the first time that the diffusion model can be applied to text recognition. Furthermore, experimental results on publicly available datasets show that the proposed method achieves competitive accuracy compared to state-of-the-art methods.

Tags:

Scene Text Recognition

Document analysis

diffusion model

deep learning

machine learning

More Like This

28 Mar 2025

Signal Processing and Deep Learning for Practical Active Noise Control

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

14 Feb 2025

Short Course Bundle: ICASSP 2023 COURSE 4: Graph Neural Networks (Parts 1-4)

SPS

Members: $65.00
IEEE Members: $85.00
Non-members: $100.00

11 Feb 2025

Short Course Bundle: ICASSP 2023 COURSE 2: Graph Signal Processing and Geometric Learning: A Foundational Approach (Parts 1-4)

SPS

Members: $65.00
IEEE Members: $85.00
Non-members: $100.00