TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Paper
•
2109.10282
•
Published
•
11
library_name: transformers tags:
This is a finetuned version of TROCR Large specialized in handwritten text recognition for Arabic and English languages.
This is a finetuned version of Microsoft's TROCR Large model, adapted for handwritten text recognition in Arabic and English languages using the Khatt and IAM Handwriting datasets.
This model is intended for handwritten text recognition in Arabic and English documents.
The model should not be used for:
Users should be aware that the model is specifically trained for Arabic and English handwritten text and may not perform well on other languages or printed text.
# Sample code to load the model
from transformers import TrOCRProcessor, VisionEncoderDecoderModel
processor = TrOCRProcessor.from_pretrained("David-Magdy/TR_OCR_LARGE")
model = VisionEncoderDecoderModel.from_pretrained("David-Magdy/TR_OCR_LARGE")
Base model
microsoft/trocr-large-handwritten