Port and/or rewrite HardsubX to Rust
HardsubX is a burned-in subtitle extractation subsystem for CCExtractor. It uses FFmpeg to parse the video frames, followed by an OCR recognition using Tesseract to detect the burned in subtiltes.
Your job is to port and/or rewrite the HardsubX subsystem to Rust, while also fixing existing bugs, improving the documentation and code quality. This is a high value task and we'd love to have it done, but in order to qualify you need to fix some of the existing bugs.
- Abhinav's Blog on the original implementation of HardsubX
- Extract hard-coded subtitles from video streams
- HardsubX docs
Related GitHub Issues:-
Other Qualification tasks
Take a look at this page.