You are not logged in.

#1 2017-02-20 20:59:29

lvdd
Member
Registered: 2017-02-20
Posts: 1

Tesseract not regognized by gscan2pdf 1.7.2-1

Hi,

I just installed gscan2pdf 1.7.2-1 from AUR (including all dependencies) but it doesn't recognise tesseract being installed. I get a message at startup saying that no OCR is installed although tesseract (including langpacks) definitely is and it is working fine with other applications like paperwork. Installing gocr or cuneiform solves the issue but those ocr engines don't produce usable results for me.
Otherwise the application is working fine. I just need a working ocr.

$tesseract -v
tesseract 3.05.00
leptonica-1.74
libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.1) : libpng 1.6.28 : libtiff 4.0.7 : zlib 1.2.11 : libwebp 0.5.2

$which tesseract
/usr/bin/tesseract

I am uncertain if this is problem with the AUR or a general gscan2pdf issue that should be reported upstream. Anybody having a similar problems?
I have a debug log from gscan2pdf --log=xxx but don't see any errors in it.

INFO - Starting gscan2pdf 1.7.2
INFO - Log level DEBUG
INFO - Using en_US.UTF-8 locale
INFO - Startup LC_NUMERIC C
INFO - Reading config from /home/lvdd/.gscan2pdf
INFO - Config file version 1.7.2
DEBUG - $VAR1 = {
          'pdf compression' => 'auto',
          'window_x' => 356,
          'OCR output' => 'replace',
          'selection' => undef,
          'threshold tool' => 80,
          'scan-reload-triggers' => [
                                      'mode'
                                    ],
          'pdf font' => undef,
          'Blank threshold' => '0.005',
          'cache' => undef,
          'window_width' => 2002,
          'title-suggestions' => undef,
          'Paper' => {
                       'A4' => {
                                 'y' => 297,
                                 'l' => 0,
                                 't' => 0,
                                 'x' => 210
                               },
                       'US Letter' => {
                                        'x' => 216,
                                        't' => 0,
                                        'l' => 0,
                                        'y' => 279
                                      },
                       'US Legal' => {
                                       'x' => 216,
                                       't' => 0,
                                       'y' => 356,
                                       'l' => 0
                                     }
                     },
          'subject' => undef,
          'close_dialog_on_save' => 1,
          'window_height' => 1261,
          'version' => '1.7.2',
          'auto-open-scan-dialog' => '1',
          'device blacklist' => '',
          'default filename' => '%a %y-%m-%d',
          'device' => 'plustek:libusb:003:007',
          'frontend' => 'libsane-perl',
          'cycle sane handle' => '',
          'udt_on_scan' => '',
          'Page range' => 'all',
          'TMPDIR' => undef,
          'ocr engine' => 'gocr',
          'post_save_hook' => '',
          'message' => {
                         'No devices found' => {},
                         'Some pages have not been saved.
Do you really want to quit?' => {},
                         'Warning: missing packages
OCR requires gocr, tesseract, ocropus, or cuneiform
' => {},
                         'Warning: missing packages
Save as DjVu requires djvulibre-bin
OCR requires gocr, tesseract, ocropus, or cuneiform
' => {},
                         'The help viewer requires module Gtk2::Ex::PodViewer
Alternatively, try: gscan2pdf --help

' => {}
                       },
          'author' => undef,
          'allow-batch-flatbed' => '',
          'current_psh' => undef,
          'default profile' => undef,
          'scan prefix' => '',
          'cwd' => '/home/lvdd',
          'thumb panel' => 100,
          'unpaper on scan' => '',
          'window_maximize' => '',
          'author-suggestions' => undef,
          'date offset' => 0,
          'current_udt' => 'gimp %i',
          'visible-scan-options' => {
                                      'calibration-cache' => 1,
                                      'y' => 1,
                                      'resolution' => 1,
                                      'wait-for-button' => 1,
                                      'batch-scan' => 1,
                                      'contrast' => 1,
                                      'compression' => 1,
                                      'brightness' => 1,
                                      'button-wait' => 1,
                                      'source' => 1,
                                      'l' => 1,
                                      't' => 1,
                                      'overscan-top' => 1,
                                      'page-width' => 1,
                                      'overscan-bottom' => 1,
                                      'threshold' => 1,
                                      'adf-mode' => 1,
                                      'pagewidth' => 1,
                                      'adf_mode' => 1,
                                      'gain' => 1,
                                      'mode' => 1,
                                      'page-height' => 1,
                                      'Paper size' => 1,
                                      'x' => 1,
                                      'speed' => 1,
                                      'pageheight' => 1
                                    },
          'unsharp sigma' => 1,
          'adf-defaults-scan-all-pages' => 1,
          'set_timestamp' => '1',
          'image type' => undef,
          'threshold-before-ocr' => '',
          'restore window' => '1',
          'scan_window_height' => 872,
          'tiff compression' => undef,
          'quality' => 75,
          'scan_window_width' => 668,
          'downsample' => '',
          'available-tmp-warning' => '10',
          'user_defined_tools' => [
                                    'gimp %i'
                                  ],
          'window_y' => 31,
          'unsharp threshold' => '0.05',
          'unsharp amount' => 1,
          'rotate facing' => '0',
          'Dark threshold' => '0.12',
          'downsample dpi' => 150,
          'libsane-perl version' => '0.05',
          'profile' => {},
          'keywords' => undef,
          'rotate reverse' => '0',
          'ocr language' => undef,
          'SANE version' => '1.0.25',
          'view files toggle' => '1',
          'OCR on scan' => '',
          'keywords-suggestions' => undef,
          'default-scan-options' => {
                                      'backend' => [
                                                     {
                                                       'mode' => 'Gray'
                                                     },
                                                     {
                                                       'resolution' => '150'
                                                     },
                                                     {
                                                       'br-x' => '210'
                                                     },
                                                     {
                                                       'br-y' => '297'
                                                     }
                                                   ],
                                      'frontend' => {
                                                      'paper' => 'A4'
                                                    }
                                    },
          'unsharp radius' => 0,
          'cache options' => 1,
          'title' => undef,
          'unpaper options' => undef,
          'subject-suggestions' => undef
        };

INFO - Operating system: linux
INFO - 
INFO - NAME="Arch Linux"
PRETTY_NAME="Arch Linux"
ID=arch
ID_LIKE=archlinux
ANSI_COLOR="0;36"
HOME_URL="https://www.archlinux.org/"
SUPPORT_URL="https://bbs.archlinux.org/"
BUG_REPORT_URL="https://bugs.archlinux.org/"

INFO - Perl version v5.24.1
INFO - Glib-Perl version 1.324
INFO - Built for Glib 2.50.2
INFO - Running with Glib 2.50.3
INFO - Gtk2-Perl version 1.2498
INFO - Built for GTK 2.24.30
INFO - Running with GTK 2.24.31
INFO - Gscan2pdf::Document version 1.7.2
INFO - Using GtkImageView version 1.6.4
INFO - Using Gtk2::ImageView version 0.05
INFO - Using PDF::API2 version 2.030
INFO - Using Sane version 1.0.25
INFO - Using libsane-perl version 0.05
DEBUG - $VAR1 = {
          'window_width' => 2002,
          'cache' => undef,
          'Blank threshold' => '0.005',
          'pdf font' => undef,
          'Paper' => {
                       'A4' => {
                                 'y' => 297,
                                 'l' => 0,
                                 't' => 0,
                                 'x' => 210
                               },
                       'US Letter' => {
                                        'x' => 216,
                                        't' => 0,
                                        'l' => 0,
                                        'y' => 279
                                      },
                       'US Legal' => {
                                       'x' => 216,
                                       't' => 0,
                                       'y' => 356,
                                       'l' => 0
                                     }
                     },
          'title-suggestions' => undef,
          'pdf compression' => 'auto',
          'scan-reload-triggers' => [
                                      'mode'
                                    ],
          'threshold tool' => 80,
          'window_x' => 356,
          'OCR output' => 'replace',
          'selection' => undef,
          'frontend' => 'libsane-perl',
          'device' => 'plustek:libusb:003:007',
          'default filename' => '%a %y-%m-%d',
          'cycle sane handle' => '',
          'udt_on_scan' => '',
          'close_dialog_on_save' => 1,
          'subject' => undef,
          'device blacklist' => '',
          'version' => '1.7.2',
          'auto-open-scan-dialog' => '1',
          'window_height' => 1261,
          'message' => {
                         'No devices found' => {},
                         'Some pages have not been saved.
Do you really want to quit?' => {},
                         'Warning: missing packages
OCR requires gocr, tesseract, ocropus, or cuneiform
' => {},
                         'Warning: missing packages
Save as DjVu requires djvulibre-bin
OCR requires gocr, tesseract, ocropus, or cuneiform
' => {},
                         'The help viewer requires module Gtk2::Ex::PodViewer
Alternatively, try: gscan2pdf --help

' => {}
                       },
          'post_save_hook' => '',
          'author' => undef,
          'allow-batch-flatbed' => '',
          'TMPDIR' => undef,
          'ocr engine' => 'gocr',
          'Page range' => 'all',
          'visible-scan-options' => {
                                      'calibration-cache' => 1,
                                      'y' => 1,
                                      'resolution' => 1,
                                      'wait-for-button' => 1,
                                      'batch-scan' => 1,
                                      'contrast' => 1,
                                      'compression' => 1,
                                      'brightness' => 1,
                                      'button-wait' => 1,
                                      'source' => 1,
                                      'l' => 1,
                                      't' => 1,
                                      'overscan-top' => 1,
                                      'page-width' => 1,
                                      'overscan-bottom' => 1,
                                      'threshold' => 1,
                                      'adf-mode' => 1,
                                      'pagewidth' => 1,
                                      'adf_mode' => 1,
                                      'gain' => 1,
                                      'mode' => 1,
                                      'page-height' => 1,
                                      'Paper size' => 1,
                                      'x' => 1,
                                      'speed' => 1,
                                      'pageheight' => 1
                                    },
          'current_udt' => 'gimp %i',
          'author-suggestions' => undef,
          'window_maximize' => '',
          'date offset' => 0,
          'current_psh' => undef,
          'thumb panel' => 100,
          'unpaper on scan' => '',
          'cwd' => '/home/lvdd',
          'scan prefix' => '',
          'default profile' => undef,
          'set_timestamp' => '1',
          'adf-defaults-scan-all-pages' => 1,
          'threshold-before-ocr' => '',
          'image type' => undef,
          'unsharp sigma' => 1,
          'downsample' => '',
          'scan_window_width' => 668,
          'quality' => 75,
          'tiff compression' => undef,
          'available-tmp-warning' => '10',
          'scan_window_height' => 872,
          'restore window' => '1',
          'libsane-perl version' => '0.05',
          'view files toggle' => '1',
          'SANE version' => '1.0.25',
          'ocr language' => undef,
          'rotate reverse' => '0',
          'keywords' => undef,
          'profile' => {},
          'unsharp amount' => 1,
          'unsharp threshold' => '0.05',
          'window_y' => 31,
          'user_defined_tools' => [
                                    'gimp %i'
                                  ],
          'downsample dpi' => 150,
          'Dark threshold' => '0.12',
          'rotate facing' => '0',
          'cache options' => 1,
          'subject-suggestions' => undef,
          'unpaper options' => undef,
          'title' => undef,
          'keywords-suggestions' => undef,
          'OCR on scan' => '',
          'unsharp radius' => 0,
          'default-scan-options' => {
                                      'backend' => [
                                                     {
                                                       'mode' => 'Gray'
                                                     },
                                                     {
                                                       'resolution' => '150'
                                                     },
                                                     {
                                                       'br-x' => '210'
                                                     },
                                                     {
                                                       'br-y' => '297'
                                                     }
                                                   ],
                                      'frontend' => {
                                                      'paper' => 'A4'
                                                    }
                                    }
        };

INFO - which convert
INFO - which scanadf
INFO - which xdg-email
INFO - which gocr
INFO - which tesseract
INFO - tesseract -v
INFO - tesseract '' '' -l ''
INFO - which ocroscript
INFO - which cuneiform
INFO - which cjb2
INFO - which unpaper
INFO - unpaper --version
INFO - which tiffcp
INFO - which pdfunite
INFO - Found Image::Magick
INFO - Found ImageMagick
INFO - Found xdg-email
INFO - Found cjb2 (djvu)
INFO - Found libtiff
INFO - Found pdfunite
INFO - Found unpaper v6.1
INFO - Checking /tmp for crashed sessions
INFO - Using /tmp/gscan2pdf-1RTp for temporary files
DEBUG - Set logger in Gscan2pdf::Dialog::Scan::Sane
DEBUG - Set logger in Gscan2pdf::Dialog::Scan
INFO - Sane->get_devices returned: $VAR1 = [
          {
            'name' => 'plustek:libusb:003:007',
            'type' => 'flatbed scanner',
            'model' => 'CanoScan N670U/N676U/LiDE20',
            'vendor' => 'Canon'
          }
        ];

DEBUG - Started setting device_list from undef to $VAR1 = [
          {
            'name' => 'plustek:libusb:003:007',
            'type' => 'flatbed scanner',
            'model' => 'CanoScan N670U/N676U/LiDE20',
            'vendor' => 'Canon'
          }
        ];

INFO - signal 'changed-device-list' emitted with data: $VAR1 = [
          {
            'label' => 'Canon CanoScan N670U/N676U/LiDE20',
            'name' => 'plustek:libusb:003:007',
            'type' => 'flatbed scanner',
            'model' => 'CanoScan N670U/N676U/LiDE20',
            'vendor' => 'Canon'
          }
        ];

DEBUG - Started setting device from  to plustek:libusb:003:007
INFO - signal 'changed-device' emitted with data: 'plustek:libusb:003:007'
DEBUG - Finished setting device from  to plustek:libusb:003:007
DEBUG - Finished setting device_list from undef to $VAR1 = [
          {
            'name' => 'plustek:libusb:003:007',
            'type' => 'flatbed scanner',
            'model' => 'CanoScan N670U/N676U/LiDE20',
            'vendor' => 'Canon'
          }
        ];

DEBUG - opening device 'plustek:libusb:003:007': Success

Thanks
lvdd

Last edited by lvdd (2017-02-20 21:14:22)

Offline

#2 2017-03-20 13:52:11

Markus00000
Member
Registered: 2011-03-27
Posts: 318

Re: Tesseract not regognized by gscan2pdf 1.7.2-1

Offline

Board footer

Powered by FluxBB