It is typically used in organizations where high accuracy and precision is required for search and retrieval such as Defense and Intelligence, legal & eDiscovery and fee based web research services. Typically deployed in a SharePoint or non-SharePoint environment where the bundled products and features are required.
Are highly accurate, precise, and relevant search results a business requirement?
Worried about the scalability performance and integration of search and classification products?
Is the ability to generate and index compound term metadata in real-time from all sources of value to you?
How important is product selection based on an open XML and web services architecture?
The platform includes the core technologies including automatic compound term metadata generation, auto-classification, and conceptTaxonomyManager. conceptSearch provides scalable enterprise search, conceptSQL indexes and classifies SQL or Oracle databases, Published API’s and the SharePoint feature sets enable integration with SharePoint and other content repositories.Core Technology White Paper
The platform is an industry unique enterprise Search and classification technology used by Government and commercial organizations as well as OEM platforms, third party integrators, and software vendors.The combination of components and plug and play technology expedites the integration into line of business applications.Technology Benefits White Paper
This table provides an overview of all Concept Searching Platforms and the components for each platform.
The following table illustrates the functionality and features available in the Concept Searching Technology platform. All technology platforms share the same core functionality. The most significant difference in the Concept Searching Technology platform is the inclusion of our enterprise search engine, conceptSearch and conceptSQL. Although the SharePoint Feature set is available with the Concept Searching Technology platform, it is frequently used in a non SharePoint environment or for those who prefer our enterprise search engine for scalability, performance, and precision searching.
For more information on a specific feature please use the hover function over the specific Feature/Function.
The Concept Searching Technology Platform is based on an open architecture with all APIs based on XML and Web Services. Transparent access to system internals including the statistical profile of terms, is standard. The base platform is installed as a feature set and comprises the following components.
conceptSearch is an enterprise search engine based on a unique, language independent technology. Unlike other enterprise search engines, which require significant customization with marginal results, conceptSearch is delivered as an out-of-the-box application that demonstrates a simple search interface and indexing facilities for internal content, web sites, file systems and XML documents. Application developers experience a minimal learning curve and the organization achieves a rapid return on investment.
conceptClassifier is a leading-edge rules based categorization module providing our clients complete control of rules-based descriptors unique to their organizations. conceptClassifier delivers a categorization descriptor table, which is easy to implement and maintain, through which all rules and terms can be defined and managed. This approach eliminates the error-prone results of ‘training’ algorithms typically found in other text retrieval solutions and enables human intervention to effectively tune classification results.
This is an advanced enterprise class, easy-to-use taxonomy development and management tool, still unique in the industry. Developed on the premise that a taxonomy solution should be used by business professionals, and not the IT team or librarians, the end result is a highly interactive and powerful tool that has been proven to reduce taxonomy development by up to 80% (client source data).
This product provides the ability to define a document structure based on information held in a Microsoft SQL Server. A document can include any number of text and metadata fields and can span multiple tables if required. conceptSQL supports SQL 2005, 2008, and 2012. A powerful but easy to use configuration tool is supplied eliminating the need for any programming. Templates are provided for out of the box support for Documentum, Hummingbird, and Worksite/Interwoven DMS.
The SharePoint Feature Set includes the following components: farm solution with feature sets, Term Store integration, taxonomy tree control for editing, refinement panel integration, event handlers for notification of changes, management of classification status column, web service advanced functionality (implement system update or preserve GUIDS), automated site column creation.
Provides scalability to accommodate size of end user community.
Provides scalability of classification to increase speed of classification throughput especially when classification on the fly is an important requirement.
All the functions you need to start gaining control over your unstructured content are included in the base platforms. Our clients have discovered the unique and varied uses of the technology to solve a wide variety of content management challenges. Below is a list of the base platform and optional products that are needed to solve your particular business process challenge and leverage your technology investment.
Why wait? Improve your business processes and positively impact your bottom line starting today.
With the exponential increase in unstructured information, enterprises are seeking new ways to improve not only the search and retrieval process but to identify tools to manage, capitalize on, and leverage their information assets to improve organizational performance. Moving beyond keyword metadata and traditional taxonomy approaches, the use of compound term processing, or identifying ‘concepts in context’ effectively addresses the issue of managing unstructured content and enables organizations to more effectively find, organize, and manage their information capital.
Based on industry unique compound term processing
Ability to auto-classify content from diverse internal and external repositories
Powerful and still industry unique enterprise search engine, conceptSearch
Ability to generate semantic metadata and surface it to any search engine to improve search results
Ability to automatically tag content with vocabulary or retention codes for records management
Ability to provide intelligent migration capabilities based on the semantic metadata within content, identify previously undeclared documents of record, unidentified privacy exposures, or information that should be archived or deleted
Ability to cleanse data to be used in text analytics by identifying relevant, accurate information and identifying previously undeclared records or privacy data that should be exempt from the text analytics process
Ability to provide granular and structured identification of people, content recommendations, and organizational knowledge assets
Concept Searching’s technologies still have not been replicated in the marketplace. The technologies are unique, language independent, and the first content retrieval solution to integrate relevance ranking based on Bayesian Inference Probabilistic Model and concept identification based on Shannon’s Information Theory. The key features include:
The real value of your investment includes both technology and the demonstrable ROI that can be generated from improving business processes. The Concept Searching Technology platform has been deployed by clients to solve individual or multiple challenges including:
Concept Searching has a current Enterprise Authority to Operate (ATO) US Air Force, a current Enterprise Certificate of Networthiness (CoN) US Army, and has been deployed on the SIPR, NIPR, and DISA networks.
Concept Searching’s industry unique compound term processing technology delivers outcomes that are not achieved by any other classification engine. Compound term processing means that Concept Searching’s statistical engine can understand, out-of-the-box, the incremental value of keywords, multi-word fragments, and compound terms. As a result, it can identify concepts resident within an organization’s own information repositories that are highly correlated to particular topics. With the identification of these highly correlated topics in the form of keywords, multi-word fragments and compound terms the result is automatically generated intelligent metadata that is unique to the organization. By using these compound terms in any application that requires metadata, the outcomes are highly accurate, because the ambiguity inherent in single words is no longer an issue.
A search for ‘triple heart bypass’ will locate documents about this topic even if this precise phrase is not contained in any document. A concept search using compound term processing can extract the key concepts, in this case “triple heart bypass” and use these concepts to retrieve relevant documents containing concepts such as ‘heart surgery’, ‘coronary artery bypass’, or ‘open heart surgery’.
conceptTaxonomyManager remains unique in the industry in features that provide the ability to rapidly and easily change the taxonomy as the organizational needs and requirements change. This is important as a taxonomy must remain fluid as opposed to static and must be managed in a way that facilitates change. The easy to use taxonomy and automatic classification tools create the framework to classify content based on concepts to one or more nodes in the taxonomy or multiple taxonomies. The conceptTaxonomyManager component is included in the base product in all platforms.
conceptTaxonomyManager is a simple yet powerful tool with an intuitive user interface designed for Subject Matter Experts (SME) without the need for IT, Information Scientists, or specific application skills, to build, maintain and validate taxonomies for the enterprise. This feature has been shown to reduce taxonomy development by up to 80% (client source data).
Eliminating complex Boolean rules and the need for training sets the taxonomy nodes can be automatically generated from the compound terms found in the document corpus. The Subject Matter Expert (SME) has full control of the terms to be used as well as the weighting of the term based on its relevancy. This enables a much more robust taxonomy as the terms are suggested based on the organization’s own content and can offer the SME new terms from the relevant documents that may not have been identified. The Clues can also be assigned a score or weight, either positive or negative to improve the classification. Clues can also be assigned a Type. Types include standard, case-sensitive, metadata, phonetic, and RegEx (Regular Expression).
Automatic document movement feedback enables the SME to see the cause and effect on changing the clue weightings for a node in the taxonomy. The user can also search within the refined node and bring back documents from the whole corpus now classified against the node. The system will indicate if the change has increased the score, reduced the score as well as identify documents that will no longer be classified and the new documents that will be classified.
This feature is a requirement for organizations that have many taxonomy operators, extremely large collections of documents, and where taxonomy management is a critical business process. This feature can be implemented on any number of servers and several taxonomy managers can be assigned to a server to ensure the level of throughput needed. Real time locking mechanisms are used to make nodes of the taxonomy inaccessible to other taxonomy managers while the node is being edited. The taxonomy managers can visually see when a node is locked and who has locked it as well as when it becomes available. The Distributed Taxonomy Management feature is totally transparent to the end user and all locking and unlocking of the nodes by the taxonomy managers are coordinated by the central server.
The product provides a full security model enabling lock down of nodes, branches, and complete taxonomies to particular users and/or groups of users. Also supports rollback to the previous state.
conceptTaxonomyWorkflow is a powerful add-on product to automate manual business processes. conceptTaxonomyWorkflow serves as a strategic tool managing migration activities and content type application across multiple SharePoint and non-SharePoint farms and is also platform agnostic. This add-on component delivers value specifically in migration, data privacy, records management or any application or business process that requires workflow capabilities. It is required to apply an action on a document and optionally, automatically apply a content type and route to the appropriate repository for disposition.
Concept Searching is the only available solution that addresses the challenges in managing unstructured and semi-structured data in SharePoint and non SharePoint environments.
Our intelligent metadata enabled solutions address the following challenges with one set of technologies, leverages an enterprises investment in SharePoint, and reduces resources to maintain and manage the solution.
Enterprise Search enables concept based searching by providing the search engine index with the compound terms and semantic metadata.
Records Identification is improved through the elimination of end user tagging, automatic declaration of records, and taxonomy workflow capabilities.
Data Privacy and the protection of confidential data as defined by the organization is identified in real-time and routed to a secure repository for disposition.
Intelligent Migration is accomplished through the identification and auto-classification of the unstructured or semi-structured data to one or more taxonomies.
Text Analytics can be performed to extract organizationally defined descriptors and concepts from diverse repositories.
Social Networking and collaboration applications gain structure and the ability to retrieve highly relevant and granular information.
eDiscovery and FOIA time and costs are reduced through identification of highly detailed information and conceptually similar information that typically would not be found.