conceptClassifier for SharePoint

conceptClassifier for SharePoint makes sense of unstructured and semi-structured data, driving improved search, collaboration, compliance, and information governance

SharePoint / SharePoint Online Blogs Core Technology  White Paper SharePoint Webinar Request a Demo 

conceptClassifier for SharePoint

Concept Searching
OverviewFeaturesSpecsBenefitsDifferentiators

conceptClassifier for SharePoint generates compound term metadata, auto-classifies content to taxonomies, and eliminates end user tagging.

Available as an Add-in, conceptClassifier for SharePoint supports all versions of SharePoint on-premises. Real-time bidirectional population of term sets improves accuracy, reduces maintenance, and the costs of manual entry. Designed as a reusable enterprise framework that improves search, enforces compliance, delivers intelligent migration, provides proactive identification of security exposures, and secure collaboration.

enterprise-search-information-transparency

Would the ability to consistently retrieve relevant results in search be of value?

incorrect-metadata

Would you like to be able to tag and classify content accurately without human intervention?

policy

Are you concerned about data breaches and noncompliance regarding records management policies?

taxonomy

Looking for technologies that are easy to install, use, and manage, with little system overhead?

migration

Have you been postponing migration based on past experiences such as budget overruns, time, and costs?
Key Features

Key Features

The platform is an infrastructure component providing real-time read/write capabilities with the term store. Classification on upload, workflows, and content type updating are easily implemented. Developed as deploy once, utilize multiple times, it can be integrated with any SharePoint or .Net application.

Core Technology White Paper
  • Open architecture with all APIs based on XML and web services
  • Automatic generation of multi-term metadata, as well as keywords, entity extraction, acronyms (Unique)
  • Concept mapping, relationship identification, auto-clue suggestion (Unique), immediate feedback on taxonomy changes without re-indexing corpus (Unique)
  • Ability to generate and classify content from diverse internal and external repositories including those residing outside of SharePoint
  • Simultaneous native updating of the SharePoint Term Store (Unique)
  • Powerful, highly interactive taxonomy tools designed for the Subject Matter Expert (Unique)
  • Template architecture for rapid deployment, automatically maintains GUID’s across environments.
  • Provides transparent access to system internals including statistical profile of terms
  • Available as a Microsoft Add-in all environments
  • Eliminates manual tagging
  • Enables hybrid and concept based searching
  • Facilitates records management and eliminates noncompliance
  • Protects record integrity throughout the individual document lifecycle
  • Intelligently migrates content
  • Assists in the migration of content by identifying records as well as content that should have been archived, contains sensitive information, or should be deleted
  • Enables data cleansing for text analytics
  • Detects and automatically secures unknown privacy exposures
  • Enhances eDiscovery, litigation support, and FOIA
Key Benefits

Key Benefits

Concept-based searching requires no end user training, provides navigation by taxonomy hierarchy, and supports multiple search methods. In any project that uses metadata you will experience improved outcomes, reduced effort and time, and rapid deployment.

Technology Benefits White Paper

Features

conceptClassifier for SharePoint supports all versions of SharePoint in an on-premises environment.  The table below illustrates the functionality and features available in SharePoint.

For more information on a specific feature, please use the hover function over the specific Feature/Function.

  •  

    Taxonomy/Term Store Functionality

    Taxonomy Rollback
    Available in 55 languages, including Japanese, Korean, and Chinese.
    Text mining to identify candidate terms
    Relationship Definition
    Concept Mapping
    Calculations Feedback
    Managed Navigation
    Import, combine, organize and harmonize taxonomy models
    Polyhierarchy Support
    Boosting Ability
    Distributed taxonomy management
    Synonym support
    Instant feedback on taxonomy changes
    Controlled Vocabularies
    Ability to automatically suggest classification clues for term set terms
    Security
    Microsoft Platform Integration
    Manual tagging
    Automatic tagging and classification of content within and outside of SharePoint with ‘conceptual’ metadata
    Automatically classifies all SharePoint content, libraries, blogs, wikis, and threads lists
    Native Integration with the Term Store
    Indexer embedded within conceptClassifier and conceptTaxonomyManager
    Automatic generation of compound terms
    Integrated with Refinement Panel
    Managed by Subject Matter Experts
    Maintenance of term set GUIDs between environments
    Search integration
    Search improvements via the Refinement Panel via taxonomy integration
    Enables integration and concept based searching with any search engine

    Intelligent Metadata Enabled Solutions (Information Governance and Compliance)

    Records identification
    Data privacy and information security
    Intelligent migration
    Content Optimization
    Enterprise Content Management
    Text Mining and Analytics
    Research and Intelligence
    Compliance
    Secure Collaboration
    eDiscovery, FOIA, Litigation Support
    Knowledge Management and Expertise Location
    Forensic Analysis
    Mergers and Acquisitions
    Enterprise Metadata Repository

    SharePoint 2010

    Taxonomy/Term Store Functionality

    : -
    Taxonomy Rollback
    Available in 55 languages, including Japanese, Korean, and Chinese.
    Text mining to identify candidate terms
    Relationship Definition
    Concept Mapping
    Calculations Feedback
    Managed Navigation : Through the term set
    Import, combine, organize and harmonize taxonomy models
    Polyhierarchy Support : Partial
    Boosting Ability
    Distributed taxonomy management
    Synonym support
    Instant feedback on taxonomy changes
    Controlled Vocabularies : Can be loaded into the Term Store
    Ability to automatically suggest classification clues for term set terms
    Security
    Microsoft Platform Integration : N/A
    Manual tagging
    Automatic tagging and classification of content within and outside of SharePoint with ‘conceptual’ metadata
    Automatically classifies all SharePoint content, libraries, blogs, wikis, and threads lists
    Native Integration with the Term Store : N/A
    Indexer embedded within conceptClassifier and conceptTaxonomyManager
    Automatic generation of compound terms
    Integrated with Refinement Panel
    Managed by Subject Matter Experts
    Maintenance of term set GUIDs between environments
    Search integration
    Search improvements via the Refinement Panel via taxonomy integration
    Enables integration and concept based searching with any search engine

    Intelligent Metadata Enabled Solutions (Information Governance and Compliance)

    : -
    Records identification
    Data privacy and information security
    Intelligent migration
    Content Optimization
    Enterprise Content Management
    Text Mining and Analytics
    Research and Intelligence
    Compliance
    Secure Collaboration
    eDiscovery, FOIA, Litigation Support
    Knowledge Management and Expertise Location
    Forensic Analysis
    Mergers and Acquisitions
    Enterprise Metadata Repository

    SharePoint 2013/2016 Online and On-premises

    Taxonomy/Term Store Functionality

    : -
    Taxonomy Rollback
    Available in 55 languages, including Japanese, Korean, and Chinese.
    Text mining to identify candidate terms
    Relationship Definition
    Concept Mapping
    Calculations Feedback
    Managed Navigation : Through the term set
    Import, combine, organize and harmonize taxonomy models
    Polyhierarchy Support : Partial
    Boosting Ability
    Distributed taxonomy management
    Synonym support
    Instant feedback on taxonomy changes
    Controlled Vocabularies : Can be loaded into the Term Store
    Ability to automatically suggest classification clues for term set terms
    Security
    Microsoft Platform Integration : N/A
    Manual tagging
    Automatic tagging and classification of content within and outside of SharePoint with ‘conceptual’ metadata
    Automatically classifies all SharePoint content, libraries, blogs, wikis, and threads lists
    Native Integration with the Term Store : N/A
    Indexer embedded within conceptClassifier and conceptTaxonomyManager
    Automatic generation of compound terms
    Integrated with Refinement Panel
    Managed by Subject Matter Experts
    Maintenance of term set GUIDs between environments
    Search integration
    Search improvements via the Refinement Panel via taxonomy integration
    Enables integration and concept based searching with any search engine

    Intelligent Metadata Enabled Solutions (Information Governance and Compliance)

    : -
    Records identification
    Data privacy and information security
    Intelligent migration
    Content Optimization
    Enterprise Content Management
    Text Mining and Analytics
    Research and Intelligence
    Compliance
    Secure Collaboration
    eDiscovery, FOIA, Litigation Support
    Knowledge Management and Expertise Location
    Forensic Analysis
    Mergers and Acquisitions
    Enterprise Metadata Repository

    conceptClassifier for SharePoint

    Taxonomy/Term Store Functionality

    : -
    Taxonomy Rollback
    Available in 55 languages, including Japanese, Korean, and Chinese.
    Text mining to identify candidate terms
    Relationship Definition
    Concept Mapping
    Calculations Feedback
    Managed Navigation : Through the term set
    Import, combine, organize and harmonize taxonomy models
    Polyhierarchy Support
    Boosting Ability
    Distributed taxonomy management
    Synonym support
    Instant feedback on taxonomy changes
    Controlled Vocabularies : Can be loaded into the Term Store
    Ability to automatically suggest classification clues for term set terms
    Security
    Microsoft Platform Integration
    Manual tagging
    Automatic tagging and classification of content within and outside of SharePoint with ‘conceptual’ metadata
    Automatically classifies all SharePoint content, libraries, blogs, wikis, and threads lists
    Native Integration with the Term Store
    Indexer embedded within conceptClassifier and conceptTaxonomyManager
    Automatic generation of compound terms
    Integrated with Refinement Panel
    Managed by Subject Matter Experts
    Maintenance of term set GUIDs between environments
    Search integration
    Search improvements via the Refinement Panel via taxonomy integration
    Enables integration and concept based searching with any search engine

    Intelligent Metadata Enabled Solutions (Information Governance and Compliance)

    : -
    Records identification
    Data privacy and information security
    Intelligent migration
    Content Optimization
    Enterprise Content Management
    Text Mining and Analytics
    Research and Intelligence
    Compliance
    Secure Collaboration
    eDiscovery, FOIA, Litigation Support
    Knowledge Management and Expertise Location
    Forensic Analysis
    Mergers and Acquisitions
    Enterprise Metadata Repository

    Technology Specifications

    conceptClassifier for SharePoint is based on an open architecture with all APIs based on XML and web services. Transparent access to system internals, including the statistical profile of terms, is standard. conceptClassifier for SharePoint supports SharePoint 2007, 2010, and 2013. conceptClassifier for SharePoint Online  supports all versions of SharePoint Online, both multi- tenant and dedicated Office 365.  The base platform comprises the following components.

    Base Components in conceptClassifier for SharePoint

  • conceptClassifier for SharePoint

    Both automated and manual classification is supported to one or more term sets within the Term Store and across content hubs.

    conceptTaxonomyManager

    This is an advanced enterprise-class, easy-to-use taxonomy, term set development, and management tool. It integrates natively with the SharePoint Term Store, reading and writing in real time, ensuring the taxonomy/term set definition is maintained in only one place, the SharePoint Term Store. Designed for use by subject-matter-experts, the Term Store and/or taxonomy is easily developed, tested, and refined.

    Term set migration tools are also a component of conceptTaxonomyManager and enable term sets to be developed on one server, for example, an on-premises server, and then migrated to another server, for example, an Office 365 server, in an incremental fashion and preserving all GUIDs. This is a key requirement in migration.

    Compound Term Processing Engine

    Licensed for the sole use of building and refining the taxonomy/term set, the compound term processing engine provides automatic semantic metadata generation that extracts multi-word terms or concepts along with keywords and acronyms.

    SharePoint Feature Set

    Provides SharePoint integration and an additional multi-value picklist browse taxonomy control, enabling users to combine free text and taxonomy browse searching.

    Benefits

  • Where does conceptClassifier for SharePoint fill the gaps?

     

    SharePoint has no ability to automatically update the content type for records management or privacy protection and route to the appropriate repository

    SharePoint has no taxonomy management tools to manage, test, and validate taxonomies based on the Term Store

    SharePoint has no auto-classification capabilities

    SharePoint has no ability to generate semantic metadata and surface it to search engines to improve search results

    SharePoint has no ability to automatically create and store classification metadata

    SharePoint has no ability to automatically tag content with vocabulary or retention codes for records management

    SharePoint has no ability to provide intelligent migration capabilities based on the semantic metadata within content, identify previously undeclared documents of record, unidentified privacy exposures, or information that should be archived or deleted

    SharePoint has no ability to provide granular and structured identification of people, content recommendations, and organizational knowledge assets

     

  • Leveraging Your SharePoint Investment

    When evaluating a technology purchase and the on-going investment required to deploy, customize, and maintain, the costs can scale quickly. Because conceptClassifier for SharePoint is an enterprise infrastructure component, you can leverage your investment through:

    • Native real-time read/write with the term store
    • Ability to implement workflow and automatic content type updating
    • Reduce IT Staff requirements to support diverse applications
    • Reduce costs associated with the purchase of multiple, stand-alone applications
    • Deploy once, utilize multiple times
    • Rapidly integrated with any SharePoint or any .Net application
    • Used by Subject Matter Experts, not IT staff, does not require outside resources to manage and maintain
    • Eliminate unproductive and manual end user tagging and the support required by business units and IT
    • Reduce hardware expansion costs due to scalability and performance features
    • Deployable as an on-premise, cloud, or hybrid solution

    Leveraging Your Business Investment

    The real value of your investment includes both technology and the demonstrable ROI that can be generated from improving business processes. conceptClassifier for SharePoint has been deployed by clients to solve individual or multiple challenges including:

    • Enables concept based searching regardless of search engine
    • Reduces organizational costs associated with data exposures, remediation, litigation, fines and sanctions
    • Eliminates manual metadata tagging and human inconsistencies that prohibit accurate metadata generation
    • Prevents the portability and electronic transmission of secured assets
    • Assists in the migration of content by identifying records as well as content that should have been archived, contains sensitive information, or should be deleted
    • Protects record integrity throughout the individual document lifecycle
    • Creates virtual centralization through the ability to link disparate on-premise and off-premise content repositories
    • Ensures compliance with industry and government mandates enabling rapid implementation to address regulatory changes
  • Concept Searching has a current Enterprise Authority to Operate (ATO) US Air Force, a current Enterprise Certificate of Networthiness (CoN) US Army, and has been deployed on the SIPR, NIPR, and DISA networks. 

    Technology and Business Differentiators

  • Compound Term Processing


    Concept Searching’s industry unique compound term processing technology delivers outcomes that are not achieved by any other classification engine. Compound term processing means that Concept Searching’s statistical engine can understand, out-of-the-box, the incremental value of keywords, multi-word fragments, and compound terms. As a result, it can identify concepts resident within an organization’s own information repositories that are highly correlated to particular topics. With the identification of these highly correlated topics in the form of keywords, multi-word fragments and compound terms the result is automatically generated intelligent metadata that is unique to the organization. By using these compound terms in any application that requires metadata, the outcomes are highly accurate, because the ambiguity inherent in single words is no longer an issue.

    A search for ‘triple heart bypass’ will locate documents about this topic even if this precise phrase is not contained in any document. A concept search using compound term processing can extract the key concepts, in this case “triple heart bypass” and use these concepts to retrieve relevant documents containing concepts such as ‘heart surgery’, ‘coronary artery bypass’, or ‘open heart surgery’.

  • Industry Unique Taxonomy Management Features

    conceptTaxonomyManager remains unique in the industry in features that provide the ability to rapidly and easily change the taxonomy as the organizational needs and requirements change. This is important as a taxonomy must remain fluid as opposed to static and must be managed in a way that facilitates change. The easy to use taxonomy and automatic classification tools create the framework to classify content based on concepts to one or more nodes in the taxonomy or multiple taxonomies. The conceptTaxonomyManager component is included in the base product in all platforms.

  • Ease of Use

    conceptTaxonomyManager is a simple yet powerful tool with an intuitive user interface designed for Subject Matter Experts (SME) without the need for IT, Information Scientists, or specific application skills, to build, maintain and validate taxonomies for the enterprise. This feature has been shown to reduce taxonomy development by up to 80% (client source data).

    Automatic Clue Suggestion

    Eliminating complex Boolean rules and the need for training sets the taxonomy nodes can be automatically generated from the compound terms found in the document corpus. The Subject Matter Expert (SME) has full control of the terms to be used as well as the weighting of the term based on its relevancy. This enables a much more robust taxonomy as the terms are suggested based on the organization’s own content and can offer the SME new terms from the relevant documents that may not have been identified. The Clues can also be assigned a score or weight, either positive or negative to improve the classification. Clues can also be assigned a Type. Types include standard, case-sensitive, metadata, phonetic, and RegEx (Regular Expression).

    Document Movement Feedback

    Automatic document movement feedback enables the SME to see the cause and effect on changing the clue weightings for a node in the taxonomy. The user can also search within the refined node and bring back documents from the whole corpus now classified against the node. The system will indicate if the change has increased the score, reduced the score as well as identify documents that will no longer be classified and the new documents that will be classified.

    Distributed Taxonomy Management

    This feature is a requirement for organizations that have many taxonomy operators, extremely large collections of documents, and where taxonomy management is a critical business process. This feature can be implemented on any number of servers and several taxonomy managers can be assigned to a server to ensure the level of throughput needed. Real time locking mechanisms are used to make nodes of the taxonomy inaccessible to other taxonomy managers while the node is being edited. The taxonomy managers can visually see when a node is locked and who has locked it as well as when it becomes available. The Distributed Taxonomy Management feature is totally transparent to the end user and all locking and unlocking of the nodes by the taxonomy managers are coordinated by the central server.

    Security and Rollback

    The product provides a full security model enabling lock down of nodes, branches, and complete taxonomies to particular users and/or groups of users. Also supports rollback to the previous state.

    conceptTaxonomyWorkflow

    conceptTaxonomyWorkflow is a powerful add-on product to automate manual business processes. It serves as a strategic tool managing migration activities and content type application across multiple SharePoint and non-SharePoint farms and is also platform agnostic. This add-on component delivers value specifically in migration, data privacy, records management or any application or business process that requires workflow capabilities. It is required to apply an action on a document and optionally, automatically apply a content type and route to the appropriate repository for disposition.

  • Intelligent Metadata Enabled Solutions

    Concept Searching is the only available solution that addresses the challenges in managing unstructured and semi-structured data in SharePoint and non SharePoint environments.

    Our intelligent metadata enabled solutions address the following challenges with one set of technologies, leverages an enterprises investment in SharePoint, and reduces resources to maintain and manage the solution.

  •  

    Enterprise Search enables concept based searching by providing the search engine index with the compound terms and semantic metadata.

    Records Management is improved through the elimination of end user tagging, automatic declaration of records, and taxonomy workflow capabilities.

    Data Privacy and the protection of confidential data as defined by the organization is identified in real-time and routed to a secure repository for disposition.

    Intelligent Migration is accomplished through the identification and auto-classification of the unstructured or semi-structured data to one or more taxonomies.

    Text Analytics can be performed to extract organizationally defined descriptors and concepts from diverse repositories.

    Secure Collaboration ensures only information that is appropriate is shared with staff, partners, and stakeholders.

    eDiscovery and FOIA time and costs are reduced through identification of highly detailed information and conceptually similar information that typically would not be found.