Win 8.1 file content indexing

Andrewgerm

New Member
Messages
2
Good day all

I have an issue at a client that has thousands of invoices stored in folders, in PDF format. The often need to look up invoices based on invoice numbers, or other details (such as vehicle registration number) etc.

I have added these folders as a location to be indexed, installed the latest Adobe Reader, and made sure the ifilter value in the registry is the correct one, and confirmed this is available under file types to index. PDFs are set to have content indexed.

PDFs are created by printing to the Bullzip printer driver.
This results in a PDF with textual content, with a few blocks of text (like an invoice) holding customer details, job details, and contact details.

Now, the problem arises when they search, in that when the search for an alphanumeric string, such as a vehicle registration number, results seem to contain strings found as if different boxes of text are being joined. E.g. part of a registration number, and part of a number from another area in the file, being combined, and then matching the search criteria.

This doesn't male for very helpful results.

Could this be caused by something wrong in the way search is setup? Or in the way the files are created?
This all worked perfectly when they were running it all on Windows XP

Any input would be a great help.
Thank you in advance.
 

My Computer

System One

  • OS
    Windows Vista x64, Ubuntu 14.04, Android 4.2.2
Are they against a 3rd party app? If not, take a look at Agent Ransack.
 

My Computer

System One

  • OS
    Windows 10 Pro X64
    Computer type
    PC/Desktop
    System Manufacturer/Model
    Lenovo IdeaCenter K450
    CPU
    Intel Quad Core i7-4770 @ 3.4Ghz
    Motherboard
    Lenovo
    Memory
    16.0GB PC3-12800 DDR3 SDRAM 1600 MHz
    Graphics Card(s)
    Intel Integrated HD Graphics
    Sound Card
    Realtek HD Audio
    Monitor(s) Displays
    HP h2207
    Screen Resolution
    1680x1050@59Hz
    Hard Drives
    250GB Samsung EVO SATA-3 SSD;
    2TB Seagate ST2000DM001 SATA-2;
    1.5TB Seagate ST3150041AS SATA
    PSU
    500W
    Keyboard
    Wired USB
    Mouse
    Wired USB
    Internet Speed
    3GB Up, 30GB Down
    Browser
    SeaMonkey
    Antivirus
    Windows Defender; MBAM Pro
    Other Info
    UEFI/GPT
    PLDS DVD-RW DH16AERSH
@striker
Thank you for the suggestion.

They'd just be happy to have high speed searching with an index.
I will do some tests with that, and advise them.

My worry is that the third party PDF printing they have will not be indexed correctly on Windows. At least with 8.1

Will post my findings.
 

My Computer

System One

  • OS
    Windows Vista x64, Ubuntu 14.04, Android 4.2.2
Agent Ransack doesn't use an Index. It interrogates the files directly. A little slower than indexed but at least it works.
 

My Computer

System One

  • OS
    Windows 10 Pro X64
    Computer type
    PC/Desktop
    System Manufacturer/Model
    Lenovo IdeaCenter K450
    CPU
    Intel Quad Core i7-4770 @ 3.4Ghz
    Motherboard
    Lenovo
    Memory
    16.0GB PC3-12800 DDR3 SDRAM 1600 MHz
    Graphics Card(s)
    Intel Integrated HD Graphics
    Sound Card
    Realtek HD Audio
    Monitor(s) Displays
    HP h2207
    Screen Resolution
    1680x1050@59Hz
    Hard Drives
    250GB Samsung EVO SATA-3 SSD;
    2TB Seagate ST2000DM001 SATA-2;
    1.5TB Seagate ST3150041AS SATA
    PSU
    500W
    Keyboard
    Wired USB
    Mouse
    Wired USB
    Internet Speed
    3GB Up, 30GB Down
    Browser
    SeaMonkey
    Antivirus
    Windows Defender; MBAM Pro
    Other Info
    UEFI/GPT
    PLDS DVD-RW DH16AERSH
Back
Top