Question:
What is the best way to index thousands of documents for use in a ASP NET 2.0 web site?
Meridian Q
2007-04-12 12:31:26 UTC
I want to be able to add/delete documents, (mainly PDF's) allow my users to add documents in a members based service. What is the best way to index these documents as they are uploaded/donated/or already exist in certain folders? Basically I am looking for a program of some sort that allows me to index them into a SQL DB or XML file where I can then write code to allow searching of the indexed information and pull up the correct documents. The basic need is what is the best tool to use to index the content so I have a db to pull from in search queries based on keywords so I can show the most relevant documents to their search text. Is there a program or plug in of somekind that already indexes PDF's etc and places them into a db or xml format so I can use this info to create my own custom search script/results layout so they can open these docs if they find what they need?
Four answers:
jake cigarâ„¢ is retired
2007-04-12 15:02:02 UTC
google and others do it! use google on your own server!
Smutty
2007-04-16 01:39:37 UTC
This is a wide topic.



The documents can be stored in a variety of ways. One of them is storing them as binary data in MS SQL. This offers advantages and disadvantages. The advantages are offered in terms of security and the fact that SQL Server offers transaction support. The disadvantage is that retrieving the data will be slow since SQL Server pages data in 8 KBs, hence retrieving a document from SQL Server will result in lots of input / output being generated.



Another approaches is store the documents on an FTP and simply store references of your documents in the DB. FTP's are much faster & they are built exactly to support file transport.



There are lots of products that offer document management & workflow management. You can also try to use one instead of trying to reinvent the wheel.



Hope this helps.
Flying_Bears_are_Cool
2007-04-12 12:38:28 UTC
Microsoft Indexing service. a default service found on all windows 2003 server OS packages would be the best way to go.
stlouis
2016-11-23 20:12:41 UTC
i take advantage of My area plenty, it made a dream come real, as I met a musician who made numerous rock songs to my lyrics. I never use it as a social community in words of contacting acquaintances etc... better as a community for assembly old and new famous bands and also a communicate board for authors and books. i'm at the moment writing a e book so the enter from different authors is rather cool for me. I also like Imeem considering the fact that Smiley invited me to affix it. I actually have only some play lists so a strategies, yet they form of characterize me as an eclectic music lover and that i have uploaded many songs from my own archives. I actually have lists that decision from Classical music to Disco/Trance, and Rock/metallic/Jazz.....yet i need to admit I do spend quite some time listening to Smiley's and could's lists too! they're only fantastic. at the same time as I pay interest to all that music.......I browse cyberspace and seem at for my e book in information superhighway sites like medical American which i'm subscribed to. MQ: New fave bands.....OPETH and The Lizards.....ohhh and that i'm virtually smitten by the cover album that Vanilla Fudge made up of Led Zeppelin's songs. thanks Smiley, as i'm able to finally savour LZ !


This content was originally posted on Y! Answers, a Q&A website that shut down in 2021.
Loading...