Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tesseract: 3.05.00 -> 4.1.1 #105089

Closed
wants to merge 1 commit into from

Conversation

r-ryantm
Copy link
Contributor

Automatic update generated by nixpkgs-update tools. This update was made based on information from https://github.com/tesseract-ocr/tesseract/releases.

meta.description for tesseract is: "OCR engine"

meta.homepage for tesseract is: "https://github.com/tesseract-ocr/tesseract"

meta.changelog for tesseract is: ""

Updates performed
  • Version update
To inspect upstream changes
Impact
Checks done (click to expand)

Rebuild report (if merged into master) (click to expand)
60 total rebuild path(s)

17 package rebuild(s)

17 x86_64-linux rebuild(s)
16 i686-linux rebuild(s)
11 x86_64-darwin rebuild(s)
16 aarch64-linux rebuild(s)


First fifty rebuilds by attrpath
ccextractor
gImageReader
gscan2pdf
invoice2data
paperless
pdfsandwich
python27Packages.pytesseract
python27Packages.tesserocr
python37Packages.pytesseract
python37Packages.tesserocr
python38Packages.pytesseract
python38Packages.tesserocr
qt-box-editor
ripgrep-all
tesseract
tesseract3
vobsub2srt
Instructions to test this update (click to expand)

Either download from Cachix:

nix-store -r /nix/store/x1ryrb7d1l4xsh3nwbcz19hq0gsi2cq6-tesseract-4.1.1 \
  --option binary-caches 'https://cache.nixos.org/ https://nix-community.cachix.org/' \
  --option trusted-public-keys '
  nix-community.cachix.org-1:mB9FSh9qf2dCimDSUo8Zy7bkq5CX+/rkCWyvRCYg3Fs=
  cache.nixos.org-1:6NCHdD59X431o0gWypbMrAURkbJ16ZPMQFGspcDShjY=
  '

(The Cachix cache is only trusted for this store-path realization.)
For the Cachix download to work, your user must be in the trusted-users list or you can use sudo since root is effectively trusted.

Or, build yourself:

nix-build -A tesseract https://github.com/r-ryantm/nixpkgs/archive/6a800561a9b9c1a69720f56d7cf4207a657304d9.tar.gz

After you've downloaded or built it, look at the files and if there are any, run the binaries:

ls -la /nix/store/x1ryrb7d1l4xsh3nwbcz19hq0gsi2cq6-tesseract-4.1.1
ls -la /nix/store/x1ryrb7d1l4xsh3nwbcz19hq0gsi2cq6-tesseract-4.1.1/bin


Pre-merge build results

We have automatically built all packages that will get rebuilt due to
this change.

This gives evidence on whether the upgrade will break dependent packages.
Note sometimes packages show up as failed to build independent of the
change, simply because they are already broken on the target branch.

Result of nixpkgs-review 1

5 packages failed to build:
  • gscan2pdf
  • python27Packages.tesserocr
  • python37Packages.tesserocr
  • python38Packages.tesserocr
  • vobsub2srt
10 packages built:
  • ccextractor
  • gImageReader
  • paperless
  • pdfsandwich
  • python27Packages.pytesseract
  • python37Packages.pytesseract
  • python38Packages.pytesseract
  • qt-box-editor
  • ripgrep-all
  • tesseract

Maintainer pings

cc @viric @erikarvstedt for testing.

@SuperSandro2000
Copy link
Member

I think we might want to retain 3.05.00 here.

@xaverdh
Copy link
Contributor

xaverdh commented Nov 27, 2020

Well apparently they did implement a fallback mode to the old engine

@Ma27
Copy link
Member

Ma27 commented Nov 27, 2020

It doesn't really matter how v4 works, the change is bogus anyway: tesseract v4 is available as tesseract4 in nixpkgs. If the maintainers feel like that should become the new default, that's fine, however this only adds a second derivation for tesseract4.

cc @ryantm I guess that this case should be supported by r-ryantm :)

@Ma27 Ma27 closed this Nov 27, 2020
@r-ryantm r-ryantm deleted the auto-update/tesseract branch November 29, 2020 22:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants