Use the Minidx Extract-Text Com component to read text content from doc, xls, pdf... etc. VC Demo - Full-text Search Blog

by minidxer on 2008-01-10 01:01:42

The article "Using the Minidx Extract-Text Com component to read text content from Word, Xls, Pdf... files" specifically explains how to use the Minidx Extract-Text Com component in Vb.Net to extract text from various file types such as Word, Excel, and Pdf. As a result, many people have sent emails asking how to call this component in C++ (some emails are judged as spam by Gmail... I strongly recommend leaving comments directly after the article or asking questions here, which can reduce my workload and avoid the need to reply individually). I took some time to create a VC Demo, which was built using VS2005 in Unicode version. Below, I will provide a brief explanation of the Demo. For some basic principles, you can directly refer to the article "Using the Minidx Extract-Text Com component to read text content from Word, Xls, Pdf... files", which I won't repeat here.