dtSearch Expands File Parsers and Converters; Content Extraction Only Licenses Available
New release also broadens API access to "stored fields," enhancing search filter and data classification options for databases and other fielded data
Online, April 12, 2010 (Newswire.com) - dtSearch Corp., a leading supplier of enterprise and developer text retrieval software, announces a new release of its enterprise and developer product line, including the dtSearch Engine. The dtSearch Engine for Win & .NET (32-bit & 64-bit) and the dtSearch Engine for Linux (32-bit & 64-bit) let developers add instant text searching and built-in file format and other data support to a wide range of Internet, Intranet and other commercial applications.
File format expansions. Responding to increased interest from developers in file parser and converter licenses, even for applications that do not involve search, dtSearch Corp. has been extending its proprietary parsers and converters. The new release includes a new XML-based conversions format to provide better access for developers to document structures, such as document properties, nested attachments, and internal structural elements (like spreadsheets inside of documents).
The new release also broadens the list of supported file types. The file parsers and converters now cover Adobe Framemaker MIF, XFA form templates, and Visio XML, in addition to existing supported file types like HTML, PDF, XSL/XML, ZIP, OpenOffice and MS Office files (through current released versions).
Fielded data enhancements. The new release also provides broader API access to "stored fields." The dtSearch Engine can generate stored fields from databases like SQL (including BLOB data).
Spider. The Spider (included with most dtSearch products and as a .NET API in the dtSearch Engine) adds local or remote web content to a searchable data collection. Supported content can be static or dynamic (ASP.NET, PHP, SharePoint, etc.).
Terabyte indexer. dtSearch products can index over a terabyte of text in a single index, as well as create and simultaneously search an unlimited number of indexes.
International language support. Built-in Unicode support covers hundreds of international languages (including right-to-left languages and Chinese/Japanese/Korean character processing options.
Other search features. Full-text and fielded data search options include: distributed or federated search options with integrated hit-highlighted display, fuzziness adjustable from 0 to 10 (to sift through typographical and spelling errors), synonym/concept/thesaurus (through a built-in thesaurus and/or user-defined synonym rings), boolean (and/or/not), phrase, phonic, wildcard, bilateral proximity, directed proximity, stemming, natural language/vector-space relevancy ranking, variable term weighting, positional scoring, field-based relevancy ranking, data classification and filtering objects, field value enumeration, numeric range searching, advanced date recognition, regular expression, unindexed search (in addition to indexed search), and special forensics search options (text filtering of forensically-recovered data, credit card search, email search, etc.).
The core of the dtSearch product line, the dtSearch Engine for Win & .NET supports C++, Java and .NET, including a .NET Spider API. The dtSearch Engine for Linux supports C++ and Java. Both platforms include native 32-bit and 64-bit builds.
More information. For more information, or to download fully-functional evaluations, please call 1-800-IT-FINDS (or 301/263-0731), email [email protected] or visit www.dtsearch.com.
# # #
About dtSearch, www.dtsearch.com
The Smart Choice for Text Retrieval® since 1991, dtSearch offers 19 years of experience in text search. The dtSearch product line includes enterprise and developer text search products, meeting some of the largest-capacity text retrieval needs in the world. dtSearch products have received hundreds of excellent press reviews and case studies. (Please see www.dtsearch.com for these.) The company has distributors worldwide, including coverage on six continents.
Share:
Tags: 64 bit, file parser, search