github OCR-D/ocrd_all v2025-04-11

latest release: v2025-04-17
13 days ago

Changed:

  • Disable ocrd_detectron2 and ocrd_segment for OSX, #460

cor-asv-ann eb47a54..bc6a590

Release: v0.2.0

  • CD PyPI: use latest packaging to fix metadata issue with twine
  • fix license string to conform to PyPI req
  • CD PyPI: use Pythonic base image
  • docker: add vars
  • Partial revert of "CI: install hunspell for ocrd-cor-asv-ann-mark test"…
  • CI: differentiate store_test_results / store_artifacts
  • requirements_test: fix typo (xdist instead of isolate)
  • CI: reduce workload further, isolate tests to free memory
  • CI: install hunspell for ocrd-cor-asv-ann-mark test
  • tests: reduce workload of combinations to fit into 1h without GPU
  • CI: cache model download, increase timeout
  • CI: forgot to prefix model download directory
  • CI: forgot to stage actual removal of Orb
  • 📦 0.2.0
  • CI: switch from Orb to CLI for Codecov
  • CI: change codecov orb version
  • setup.py → pyproject.toml (w/ dynamic setuptools-ocrd versioning)
  • ocrd-tool.json: add 2 model files as examples
  • add coverage, add tests to CI and pypi to CD
  • docker: update+improve…
  • increase default batch size for better GPU utilisation
  • autosized generators: make work with Keras 2.3.x (stateful metrics)
  • train: add model checkpointing callback
  • seq2seq: be robust if EOL is missing in binary (prob/confmat) line input
  • seq2seq: cover non-string line input (prob/confmat) in fallback
  • transcode: simplify updating sequence, improve logging…
  • transcode: pass all input TextEquiv alternatives…
  • fix charmap kwarg init
  • transcode test: abort on failure
  • add test for transcode only, cover more fileGrps and parameters, check and compare confidences
  • transcode: also remove lower hierarchy levels for consistency
  • transcode: utilise .parent_object_, simplify
  • tests: eval all against 'MULTI', test 'mark' separately, assert equal CER for both 'eval' strategies
  • replace editdistance with rapidfuzz
  • add tests
  • migrate all processors to ocrd>=3.0
  • CircleCI: separate CD and CI

docstruct 004e6ec..f8c4e46

  • Merge pull request #2 from bertsky/migrate-ocrd-v3

format-converters fa8b4b5..0894801

  • Merge pull request #25 from stweil/PEP8

ocrd_anybaseocr 0c67107..0aa1d57

Release: v2.0.0

  • 📦 v2.0.0
  • 📝 changelog
  • Merge pull request #113 from bertsky/v3-api
  • ocrd-tool: don't restrict resource_locations
  • Be defensive about uninitializing possibly uninitialized model and label_mapping
  • 3.8 typing
  • self.reset at the end of process_workspace, del self.model in shutdown
  • move models to root of package
  • adapt tests
  • rewrite with process_workspace and deferred METS modification
  • rely on resolve_resource for model path
  • use contextmanager to open pickled label mapping
  • Merge branch 'v3-api' of https://github.com/OCR-D/ocrd_anybaseocr into v3-api
  • remove redundant model-exists check
  • CI: larger resource_class
  • Merge branch 'v3-api' of https://github.com/OCR-D/ocrd_anybaseocr into v3-api
  • pickle was ignored
  • add model to package data
  • reintroduce layout-analysis
  • remove constants
  • convert to src-layout
  • remove old mets files
  • remove pix2pixhd
  • fix setuptools packages
  • CI: remove references to models
  • adapt cropper and test
  • remove everything that is not cropping-related
  • convert cropper to v3 API, setup.py -> pyproject.toml

ocrd_cis db65d7f..5cf22f5

Release: v0.2.0

  • all: adapt to more Numpy and Shapely changes/deprecations
  • get rid of remaining python-levenshtein (in favour of rapidfuzz)
  • all: migrate importlib instead of EOL pkg_resources via ocrd_utils
  • all: migrate to OCR-D v3 API (including METS Server and page multiprocessing)
  • all: some code style, type hints and optimization
  • tests: simplify, update, improve, test serially as well as page-parallel
  • CI: ditch CircleCI, add Github Actions
  • ocrd-tool.json: add dockerhub=ocrd/cis
  • docker: add labels metadata conforming to spec
  • docker: preinstall shortcut ocrd-all-tool.json via ocrd-tool.json
  • setup.pypyproject.toml, versioning based on ocrd-tool.json
  • add continuous deployment for Dockerhub
  • add continuous deployment for PyPI

ocrd_detectron2 218e0b5..3e5f838

Release: v0.2.0

  • relax setuptools version requirement
  • update setuptools to fix detectron2 build
  • CI: remove Python 3.11 (does not work with detectron2)
  • Update CHANGELOG.md
  • Merge pull request #31 from bertsky/migrate-ocrd-v3

ocrd_froc 5a5d774..323a6ba

Release: v1.0.1

  • 📦 v1.0.1
  • 📝 changelog
  • Merge pull request #20 from bertsky/patch-1
  • 📦 v1.0.0
  • 📝 changelog
  • Dockerfile: COPY ocrd-tool.json to avoid symlink
  • make docker: podman does not support --progress
  • CI/GHA: actually run make test
  • CI: docker
  • Dockerfile: set locale/XDG variables, use COPY . . and .dockerignore
  • Dockerfile: base on ocrd/core-cuda-torch
  • Update requirements-test.txt
  • tests: use subtests.test instead of plain for loop
  • unit tests
  • processor: handle ocr_method=none vs ocr_method=adaptive correctly
  • OCR-D processor: restrict to 1 worker due to single CUDA context
  • Update pyproject.toml
  • Convert to OCR-D v3

ocrd_pagetopdf 00baf15..c6b414d

Release: v2.0.1

  • 📦 v2.0.1
  • 📝 changelog
  • add changelog
  • improve multipage structmap extraction (more robust, more cases)
  • tests: debug logging only for processors
  • deps-ubuntu: add apt-get update; add deps-test
  • Merge pull request #29 from OCR-D/missing-import-re

Don't miss a new ocrd_all release

NewReleases is sending notifications on new releases.