Lineage OS Glimpse OCR & QRCode (?) Proposal

July 23, 2024

Inspired by Apple OCR for photos.

Goals

Users can “double click” to select text from an image.

Requirements

Perform OCR accurately and quickly
Provide Points & Bounding boxes
Enable users to “hold click” to select text

Guidelines

Not too many new dependencies
Should not impact build process
Careful with large dependencies
No Google Services

UX

User double taps an image to possibly select text. If a model is not present, the user is prompted to install a model. If a model is present, OCR is performed on the image. If OCR is successful, the view enters “ocr select mode”, darkening out the image. Text sequence near where the user double tapped is highlighted. User can drag their finger across the screen to select more text. User can than press either the close button to exit select mode, or press the check mark button to confirm their selection which than copies the text to their keyboard.