dtSearch Version 2021.02 Beta Debuts
August 19, 2021
has developed new versions of its enterprise and developer text
retrieval product line to instantly search terabytes of online
and offline data, spanning multiple folders, emails including
attachments and nested attachments, online data and other
The beta has a preview multithreaded indexer to greatly
increase indexing speed on multicore 64-bit Windows systems.
The beta also adds Hancom Office HWPX support to the many data
types covered by dtSearch's proprietary document filters.
The dtSearch Engine for macOS release candidate adds support for
Apple silicon M1/ARM; the dtSearch Engine developer SDKs for
macOS, Linux and Windows share cross-platform .NET 5 / .NET
Core, C++ and Java APIs for use in developing both "on premises"
and online applications, including on Azure and AWS.
With the new updates, key features of the dtSearch product
line are as follows.
Terabyte Indexer. dtSearch enterprise and developer products
can index a terabyte of text encompassing multiple folders,
emails with nested attachments, online data and other databases
in a single index. The products can create and search any number
of indexes. Index updates do not prevent continued searching.
The multithreaded 64-bit indexer speed boost applies both to new
index builds and to index updates. (The multithreaded indexer
does not involve any change to the index format.)
Concurrent, Multithreaded Searching. Indexed search covering
full-text and metadata is typically instantaneous, even in a
concurrent search environment encompassing terabytes of mixed
online and offline data. For online use, dtSearch products have
no limits on the number of concurrent search threads. The new
multithreaded indexer does not affect searching.
Document Filters and Supported Data Types. dtSearch's
proprietary document filters support Microsoft Office files,
OpenOffice files, PDFs, compression formats, emails along with
nested attachments, web-ready data, and more. For supported data
types, the document filters further support browser display with
highlighted hits. The beta also expands Hancom (a popular Korean
Office application) support to cover the HWPX document format.
Search Options. The dtSearch product line has over 25 full-text
and metadata hit-highlighted search options, with integrated
relevancy ranking across multiple data repositories.
Forensics-oriented options include identifying credit card
numbers in data and hash value generation and search. For
international languages, dtSearch products support Unicode, with
support for right-to-left languages, and special
Chinese/Japanese/Korean character options (covering text in the
HWPX format as well as in other supported formats).
Faceted Search and Other Data Classification. The dtSearch
Engine SDKs make available advanced data classification options
like faceted search and granular data classification based on
document full-text contents, internal document metadata,
database content, or data attributes associated with documents
during document indexing.
SDKs. The SDKs offer developers all of dtSearch's general search
features plus developer-focused features like faceted search and
granular data classification as well as providing access to
dtSearch's document filters. The release candidate extends the
SDKs cross-platform .NET 5 / .NET Core, C++ and Java APIs for
Windows, Linux and macOS to encompass Apple Silicon (M1, Arm).
The dtSearch Engine can operate in applications running locally
or in a cloud environment like Azure or AWS.