Updated Pricing Plans for the Self-Service Semantic Suite

The Self-Service Semantic Suite

The Self-Service Semantic Suite (S4) by Ontotext provides a platform for on-demand Smart Data management. With S4 developers get instant access to services for text analytics, knowledge graphs and semantic graph database-as-a-service in the Cloud. The goal of S4 is to increase the speed and reduce the cost of building Smart Data analytics prototypes.

The capabilities that the S4 platform provides are:

  • text analytics of unstructured content that helps you extract valuable insights from your content
    • News analytics – information extraction, disambiguation & entity linking to concepts and instances from knowledge graphs (DBpedia, Wikidata & GeoNames ). The News Analytics service is also extended with image analytics capabilities, based on the Imagga Image Tagging service, so that both the text content and images of web pages can be analyzed
    • News classifier – categorization of news articles according to the 17 top-level categories of the IPTC Subject Reference System
    • Biomedical analytics – recognition of 130+ biomedical entity types and linking them to the LinkedLifeData large-scale biomedical knowledge base
    • Twitter analytics – named entity recognition of various entities found in tweets (based on the TwitIE microblog pipeline by GATE)
  • Semantic graph database-as-a-service that helps you add, interlink and query semantic facts loaded from open knowledge graphs or discovered in your own text content
    • A self-managed version of the Ontotext GraphDB triplestore (RDF database) instantly available on the AWS Marketplace with a pay-per-use pricing
    • A fully managed database-as-a-service (DBaaS) version of GraphDB, where database instances are instantly available to developers and the S4 platform takes care of DBA aspects such as configuration, operation, maintenance and scaling
  • Access to large open knowledge graphs that enhance the semantic analysis process. The knowledge graph capability is based on the FactForge semantic data warehouse by Ontotext, integrating and aligning different open knowledge graphs such as DBpedia, Wikidata and GeoNames


Smart Data management with the Self-Service Semantic Suite (S4)

The capabilities of the Self-Service Semantic Suite are based on enterprise-grade technology by Ontotext successfully applied within use cases in verticals such as media & publishing, healthcare & life sciences, compliance, cultural heritage and digital libraries.

Transparent Pay-per-Use Pricing

As of January 1st, all S4 services are available with a transparent, pay-per-use pricing based on the volume of text content processed and the volume of RDF data stored in the semantic graph database. Startups and SMBs can now experiment more easily with smart data prototypes, since there’s no need for complex resource budgeting and software license commitments

Transparent pricing

S4 makes it easy for companies to start with small prototypes, utilizing the free tier of S4 services, or small volume text processing and semantic graph databases, and only increase the service usage if their prototypes successfully get traction – pay-as-you-grow is easy with the Self-Service Semantic Suite.

Text Analytics Services

The text analytics services available on S4 are available with a pay-per-use pricing, per gigabyte of input data processed (plain text, HTML, or PDF / MS Office formats)

service price
First 250 MB of data processed FREE
News analytics $99 / GB
News classifier $99 / GB
Biomedical analytics $99 / GB
Twitter analytics $29 / GB

For every developer account S4 provides a large free tier of 250 MB of data processed. Depending on the type of data being processed, the free tier should be sufficient to process the following number of documents:

  • More than 1,000,000 million tweets (200 bytes average size)
  • 100,000 text snippets of 2.5KB each
  • 25,000 web pages of average size 10KB

Self-Managed Semantic Graph Database

The self-managed RDF database is based on the Standard or Free editions of Ontotext’s GraphDB database. The self-managed database is available via the AWS Marketplace and can be deployed with just a few clicks.

The pricing is based on the database instance size (in terms of EC2 instances) as follows:

EC2 instance type Virtual cores RAM (GB) EC2 cost


Software price


Data volume estimate*


GraphDB Free GraphDB Standard
T2.small 1 2 0.026 FREE 50 million
T2.medium 2 4 0.052 FREE 200 million
T2.large / M4.large 2 8 0.126 FREE 0.35 500 million
R3.large 2 15 0.175 FREE 0.40 1 billion
R3.xlarge 4 30.5 0.35 FREE 0.75 2 billion
R3.2xlarge 8 61 0.70 FREE 1.40 4 billion

* Note that the optimum server configuration for your data volume will depend on several factors, such as: the nature of the data (how connected is the graph, what is the ratio of explicit to implicit statements, etc.), the query volume and the complexity of the queries. The provided estimates should be used only as a starting point and we recommend that additional benchmarking is performed with data & queries specific to the use case, so that the optimum server configuration can be selected.

The costs for the network attached SSD storage (EBS volumes, charged at $0.10 per GB per month) also depends on the database size: as an average metric, 1 million triples will require at least 100 MB of storage space. Also note that the users are charged for the software purchased via the AWS Marketplace only when the AMI is running – if the instance is stopped or terminated, there are no ongoing charges, except for the EBS storage used by the database at rest.

Fully Managed Semantic Graph Database

The fully managed database-as-a-service provides a version of the GraphDB semantic graph database where developers do not need to configure, operate and maintain the database instances. All DBA related tasks are taken care of by the S4 platform on behalf of the developers. Several DBaaS options are available, with monthly subscription pricing based on database size:

DBaaS type Data volume




XS 10 million FREE
S 50 million $49
M 250 million $179
L 1 billion $449

The fully managed databases are instantly available to developers, and accessible via simple RESTful API based on the OpenRDF API standard for RDF data management.

Next Steps

S4 provides capabilities for Smart Data prototyping, based on enterprise Semantic Technology by Ontotext. Register for a developer account, take a look at the documentation and sample code in various programming languages, and start experimenting with text analytics, semantic graph databases and open knowledge graphs!

Marin Dimitrov

Marin Dimitrov

CTO at Ontotext
As the technological captain of Ontotext, he is leading the company on the right tech route and reserving our spot on the map of the world. His sharp mind can explain complex things in a simple way, making him an invaluable resource in semantics. Marin is a frequent speaker on semantic conferences and open data meetups at various technology related events.
Marin Dimitrov

Related Posts

  • Featured Image

    Weaving Data Into Texts: The Value of Semantic Annotation

    Semantic annotation is about weaving data into textual sources. In semantically annotated texts, certain words (denoting things, people, locations, organizations, etc) are linked to data – that is, to context and references that can be processed by an algorithm.

  • Featured Image

    The New Cache on the Block: A Caching Strategy in GraphDB To Better Utilize Memory

    The ability to seamlessly integrate datasets and the speed at which this can be done are mission critical when it comes to working with big data. The new caching system of GraphDB is better, faster and smarter and solves the issues of the old caching strategy in GraphDB.

  • Featured Image

    Fighting Fake News: Ontotext’s Role in EU-Funded Pheme Project

    Before ‘fake news’ became the latest buzzword, in January 2014 Ontotext started working on Project PHEME – ‘Computing Veracity Across Media, Languages, and Social Networks’ alongside eight other partners. The EU-funded project aimed at creating a computational framework for automatic discovery and verification of information at scale and fast.

Back to top