conceptClassifier Platform

Overview

conceptClassifier is Concept Searching’s technology platform deployed in a non-SharePoint environment using our published API’s. The technology is used in a heterogeneous environment and the API’s are provided to supply the automatically generated semantic metadata to any search engine index to enable concept based searching. The API’s are also used to integrate with any other system or application that uses metadata, such as content management systems and applications such as records management, data protection, and migration.

The conceptClassifier platform is based on our Smart Content Framework™ for information governance and incorporates best practices for developing an enterprise framework to mitigate risk, automate processes, manage information, protect privacy, and address compliance issues. Underlying the framework is the technology to:

The framework is being used to enable Intelligent Metadata Enabled Solutions to improve search, records management, enterprise metadata management, text analytics, migration, enterprise social networking, and data security.

Incorporating our industry recognized Smart Content Framework™ and our intelligent metadata enabled solutions, conceptClassifier provides the technology for organizations who want to implement automatic semantic metadata generation, auto-classification, and taxonomy management, in a non-SharePoint environment.

*

When to Use the conceptClassifier Platform?

  • Need the ability to automatically generate semantic metadata, auto-classification, and taxonomy management
  • Need to deploy on-premise, in the cloud, or in a hybrid environment
  • Deployed in a  non-SharePoint environment
  • Integration with a third party search engine to be used internally, on an extranet, web site, or self-service customer portals
  • Ability to build taxonomies
  • Ability to auto-classify content from diverse repositories
  • Integration with third party applications or connect with diverse repositories
  • Take advantage of API’s, custom controls, and demonstration source code
  • On-the-fly classification of content is a priority
  • Where performance and scalability are key considerations
  • Where the platform is technology agnostic

What are the key outcomes?

The key outcomes are based on the organization’s objectives and what they want to accomplish. Organizations are using the conceptClassifier platform to:

  • Integrate with any search engine
  • Integrate with other repositories
  • Integrate with any application that requires the use of metadata
  • Eliminate manual tagging
  • Improve enterprise search
  • Facilitate records management
  • Detect and automatically secure unknown privacy exposures
  • Intelligently migrate content
  • Enhance eDiscovery, litigation support, and FOIA requests
  • Enable text analytics
  • Provide structure to enterprise social networking applications

Platform Matrix

This table provides an overview of all platforms and the components for each platform.

 

Core Components conceptClassifier for SharePoint Platform conceptClassifier for Office 365 Platform conceptClassifier Platform Concept Searching Technology Platform
conceptClassifier for SharePoint 2013 conceptClassifier for SharePoint 2010 conceptClassifier for SharePoint 2007
Compound Term Processing Engine – licensed for concept extraction only yes yes yes yes yes Full search functionality included
conceptClassifier yes yes yes yes yes yes
conceptTaxonomyManager yes yes yes yes yes yes
conceptSearch no no no no no yes
SharePoint Feature Set yes yes yes yes no yes
SharePoint Connector yes yes yes yes no yes
APIs, custom controls, demonstration source code no no no no yes yes
conceptSQL no no no no no yes
Proprietary controls for SharePoint 2007 no no yes no no yes
Optional Components conceptClassifier for SharePoint 2013 conceptClassifier for SharePoint Platform conceptClassifier for SharePoint 2010 conceptClassifier for SharePoint 2007 conceptClassifier for Office 365 Platform conceptClassifier Platform Concept Searching Technology Platform
conceptTaxonomyWorkflow yes yes no yes yes yes
conceptSearch yes yes yes yes yes Included in Base Product
conceptSQL yes yes yes yes yes Included in Base Product
Content Enrichment Service for SharePoint 2013 yes no no no no yes
FAST Pipeline Stage for SharePoint 2010 yes yes no no no yes
conceptClassifier for OneDrive for Business yes no no yes no no
Additional Classification Servers yes yes yes yes yes yes
Additional Front End Web Servers yes yes yes N/A yes yes

Features

The conceptClassifier platform is technology agnostic.  The table below illustrates the functionality and features available.

For more information on a specific feature please use the hover function over the specific Feature/Function.

Feature/Function conceptClassifier  Platform
Taxonomy Functionality
SOA Compliant yes
Industry Standards yes
Taxonomy Rollback yes
Supports 55 Languages yes
Text mining to identify candidate terms yes
Boosting yes
Relationship Definition yes
Calculations Feedback yes
Concept Mapping yes
Taxonomy Navigation yes-
Import, combine, organize and harmonize taxonomy models yes
Polyhierarchy Support yes
Folksonomy Support yes
Distributed taxonomy management yes
Synonym support yes
Instant feedback on taxonomy changes, dynamic screen updating yes
Controlled Vocabularies yes
Ability to automatically suggest classification clues for taxonomies yes
Security yes
Managed by Subject Matter Experts yes
Highly Scalable yes
Rapidly Deployed yes
Classification
Manual Tagging yes
Automatic tagging of content from diverse repositories, web sites, content management systems yes
Classification as content is created or ingested yes
Classification to one or more nodes in one or more taxonomies yes
Classifies all unstructured and semi-structured from content, libraries, blogs, wikis, and threads lists yes
Rich Web indexing support yes
Compound Term Indexing Engine embedded within conceptClassifier and conceptTaxonomyManager yes
Automatic generation of compound terms yes
Intelligent Metadata Enabled Solutions
Records Identification yes
Data privacy and information security yes
Taxonomy workflow yes
Intelligent migration yes

Technology Specifications

The conceptClassifier Platform is based on an open architecture with all APIs based on XML and Web Services. Transparent access to system internals including the statistical profile of terms, is standard. The base platform is installed as a feature set and comprises the following components.

Base Components in conceptClassifier Platform

  • conceptClassifier

Both automated and manual classification is supported to one or more taxonomies.

This is an advanced enterprise class, easy-to-use taxonomy development and management tool. conceptTaxonomyManager is a robust and powerful taxonomy management tool that is still unique in the industry. Developed under the premise that a taxonomy solution should be used by business professionals, and not IT or librarians, the end result is a highly interactive and powerful tool that has been proven to reduce taxonomy development by up to 80%.

Licensed for the sole use of building and refining the taxonomy the engine provides automatic semantic metadata generation that extracts multi-word terms or concepts along with keywords and acronyms.

Typical Recommended Base Configuration

  • Windows 2008/2012 Server with IIS
  • Modern 64 bit CPUs (ideally at least 8 cores)
  • 8GB RAM (recommended)
  • .NET Framework v4.0 or v4.5
  • Access to SQLServer (2005 or later) or Oracle (10g R2 or later)
  • IIS 6 with MetaBase enabled
  • Microsoft Office 2010 64-bit iFilter pack
  • High speed disk, Raid Array or SAN
One Farm

  • 1 conceptClassifier Server

Optional Products

conceptSQL

This product provides the ability to define a document structure based on information held in a Microsoft SQL Server or Oracle database. A document can include any number of text and metadata fields and can span multiple tables if required. conceptSQL supports SQL 2005, 2008, and 2012. A powerful but easy-to-use configuration tool is supplied eliminating the need for any programming. Templates are provided for out-of-the-box support for Documentum, Hummingbird and Worksite/Interwoven DMS.

conceptTaxonomyWorkflow

conceptTaxonomyWorkflow can perform an action on a document following a classification decision when certain criteria are met. The workflow source type works in FILE and HTTP. This product is available in non-SharePoint environments and has a plugin architecture enabling clients and integration partners to easily build plugins for both content sources and destination sources.

conceptSearch

conceptSearch is a unique, language independent enterprise search engine technology. conceptSearch is delivered as an out-of -the-box enterprise search application and offers a simple search interface and indexing facilities for internal content, web sites, file systems and XML documents. Typically used where requirements for accurate and relevant search results are mandatory.

conceptClassifier for OneDrive for Business

conceptClassifier for OneDrive for Business is an optional component that enables the full feature set of conceptClassifier for SharePoint and conceptClassifier for Office 365. For systems management, administrators can now apply policy across SharePoint, SharePoint Online/Office 365, and across OneDrive for Business. For the business users, OneDrive for Business provides them the ability to retrieve documents regardless of the device they are using (currently with some exceptions) and from any location. In addition, business users can share content, and OneDrive enables concurrent editing of a document while preserving its integrity.

conceptClassifier for OneDrive for Business provides governance, compliance, records management, and enterprise policy application, as well as collaboration and productivity enhancements. From an administrator perspective, the product provides management of all content regardless of where it resides.

Additional Classification Servers

Provides scalability of classification to increase speed of classification throughput especially when classification on the fly is an important requirement.

Application Requirements

The technology supplied in the base conceptClassifier platform includes the automatic semantic metadata generation, auto-classification, and taxonomy management. The key advantage of the software is to deploy once and use multiple times to solve varied challenges. The key applications deployed by our clients are listed below.

 

Search Engine Integration

Functionality is provided via the conceptClassifier platform with search engines and can perform on the fly classification with search engines calling the classify API. Search engine support includes SharePoint, the former FAST products, Solr, Google Search Appliance, Autonomy, and IBM Vivisimo.

Required Products: conceptClassifier Platform

Intelligent Document Classification

Functionality is provided via the conceptClassifier Platform, to classify documents based upon concepts and multi-word terms that form a concept. Automatic and/or manual classification is included. Knowledge workers with the appropriate security can also classify content in real time. Content can be classified from diverse repositories including File Shares, Exchange Public Folders, and websites. All content can be classified on the fly and classified to one or more taxonomies.

Required Product: conceptClassifier Platform