New Forum | Previous | Next | (P-PDF) Acrobat 6.0
Topic: Migrating to Catalog 6
Conf: (P-PDF) Acrobat 6.0, Msg: 99161
Date: 10/26/2003 04:28 PM
I work with a collection of PDF files -- currently about 25,000 files (=150,000 pages=300 MB). The files are accessed from CD. Sometimes users need to full-text search only a portion of the document collection, sometimes over the whole collection. I have used Acrobat 4 Catalog to build 20 indexes covering the entire document collection. Most users are on Acrobat 4 or 5 but will move to 6 in the next two years.
The new Acrobat 6 Catalog will require me to change the way I am doing certain things. I am looking for confirmation that I'm going in the right direction.
4 index vs 6 index: I know users on 4 and 5 must have an index from that era. Even though 6 can read indexes from previous versions, I have read that 6 users will see advantages to using a version 6 index. Is this true? If so, I suppose creating indexes in both versions is the best option.
Index Size: My initial test on 4,200 files (=25,000 pages=60 MB), version 4 produces a 15 MB index and version 6 produces a 50 MB index. Excluding numbers and certain words gets it down to 45 MB, but this isn't a good option with this particular collection of documents. Adobe tech support says the much larger index size is expected. But Catalog knowledge didn't seem to be the forte of the two people with whom I spoke. Is this, in fact, a typical index size from version 6? Very little compression compared to the PDF files being indexed?
Index Set-up: Users, in my experience, have difficulty mounting indexes. In version 4, I built a kind of "home page" for the document collection. It wasn't the most elegant thing in the world, but it let the user mount multiple indexes, select the index(es) to search, and open the query box without knowing much about Acrobat. In version 6, it is still possible to auto-mount an index when a file is opened. But it looks to me like the first index "un-mounts" when a second index is auto-mounted. So the only way to search across multiple indexes is to manually connect to them -- which requires me to explain to the user where the indexes are and how to mount them. And, when manually mounted, the indexes stay mounted until the user deliberately removes them (which can cause confusion if the user searches a lot of different indexes). What is the best way to handle this situation?
Thanks in advance for any advice,