Skip to main content

Vision

Image, document and video data for African contexts and scripts. These tasks share the same capture, preprocessing and annotation groundwork.

Shared across image & video data

Sourcing & rights for images and video
Capture / scan standards (resolution, format, lighting)
Preprocessing (deskew, denoise, normalization, encoding)
Annotation (labels, bounding boxes, segmentation, temporal spans)
Annotation tools & inter-annotator agreement
Metadata, dataset formatting & splits
Licensing & likeness / consent

Tasks

Image data – (classification, object detection, segmentation)
OCR & document AI – (printed & handwritten text, layout, document understanding)
Sign language, gesture & video – (recognition, glossing, sign-language translation)

Contributor

Shared across image & video data
Tasks

Join the discussion

Spotted an error, have a question, or want to share what worked on a real project? Sign in with GitHub to add your voice — every thread lives in the open, powered by GitHub Discussions.

Loading discussion…