Inline math detection
- Add new inline math detection model and benchmark
Textract OCR benchmark
Benchmark surya against textract as well as google cloud vision. For just english, results look like:
| Model | Time per page (s) | Avg Score | English |
|---|---|---|---|
| surya | 0.522628 | 0.983298 | 0.983298 |
| textract | 1.44293 | 0.947458 | 0.947458 |
XLA support
Add support for TPUs. Still fairly slow, but lots of optimizations to be made.
Minor speedups
Refactor inference to get a 5-10% speed boost across all models.
What's Changed
- Add XLA support by @iammosespaulr in #298
- Add Inline Math Detection by @tarun-menta in #297
- Update to new line detection model by @tarun-menta in #305
- Fix merging of inline boxes by drawing textlines in heatmap by @tarun-menta in #309
- XLA improvements by @VikParuchuri in #306
- Update inline math checkpoint by @VikParuchuri in #310
- Misc Line Detection Fixes by @tarun-menta in #313
- Add Textract OCR Benchmark by @tarun-menta in #307
- Inline math model, new text detection model by @VikParuchuri in #312
Full Changelog: v0.10.3...v0.11.0