Updated Pricing Plans for the Self-Service Semantic Suite

The Self-Service Semantic Suite

The Self-Service Semantic Suite (S4) by Ontotext provides a platform for on-demand Smart Data management. With S4 developers get instant access to services for text analytics, knowledge graphs and semantic graph database-as-a-service in the Cloud. The goal of S4 is to increase the speed and reduce the cost of building Smart Data analytics prototypes.

The capabilities that the S4 platform provides are:

  • text analytics of unstructured content that help you extract valuable insights from your content
    • News analytics – information extraction, disambiguation & entity linking to concepts and instances from knowledge graphs (DBpedia, Wikidata & GeoNames ). The News Analytics service is also extended with image analytics capabilities, based on the Imagga Image Tagging service, so that both the text content and images of web pages can be analysed
    • News classifier – categorisation of news articles according to the 17 top-level categories of the IPTC Subject Reference System
    • Biomedical analytics – recognition of 130+ biomedical entity types and linking them to the LinkedLifeData large-scale biomedical knowledge base
    • Twitter analytics – named entity recognition of various entities found in tweets (based on the TwitIE microblog pipeline by GATE)
  • Semantic graph database-as-a-service that helps you add, interlink and query semantic facts loaded from open knowledge graphs or discovered in your own text content
    • A self-managed version of the Ontotext GraphDB triplestore (RDF database) instantly available on the AWS Marketplace with a pay-per-use pricing
    • A fully managed database-as-a-service (DBaaS) version of GraphDB, where database instances are instantly available to developers and the S4 platform takes care of DBA aspects such as configuration, operation, maintenance and scaling
  • Access to large open knowledge graphs that enhance the semantic analysis process. The knowledge graph capability is based on the FactForge semantic data warehouse by Ontotext, integrating and aligning different open knowledge graphs such as DBpedia, Wikidata and GeoNames

 

Smart Data management with the Self-Service Semantic Suite (S4)

The capabilities of the Self-Service Semantic Suite are based on enterprise-grade technology by Ontotext successfully applied within use cases in verticals such as media & publishing, healthcare & life sciences, compliance, cultural heritage and digital libraries.

Transparent Pay-per-Use Pricing

As of January 1st, all S4 services are available with a transparent, pay-per-use pricing based on the volume of text content processed and the volume of RDF data stored in the semantic graph database. Startups and SMBs can now experiment more easily with smart data prototypes, since there’s no need for complex resource budgeting and software license commitments

Transparent pricing

S4 makes it easy for companies to start with small prototypes, utilising the free tier of S4 services, or small volume text processing and semantic graph databases, and only increase the service usage if their prototypes successfully get traction – pay-as-you-grow is easy with the Self-Service Semantic Suite.

Text Analytics Services

The text analytics services available on S4 are available with a pay-per-use pricing, per gigabyte of input data processed (plain text, HTML, or PDF / MS Office formats)

service price
First 250 MB of data processed FREE
News analytics $99 / GB
News classifier $99 / GB
Biomedical analytics $99 / GB
Twitter analytics $29 / GB

For every developer account S4 provides a large free tier of 250 MB of data processed. Depending on the type of data being processed, the free tier should be sufficient to process the following number of documents:

  • More than 1,000,000 million tweets (200 bytes average size)
  • 100,000 text snippets of 2.5KB each
  • 25,000 web pages of average size 10KB

Self-Managed Semantic Graph Database

The self-managed RDF database is based on the Standard or Free editions of Ontotext’s GraphDB database. The self-managed database is available via the AWS Marketplace and can be deployed with just a few clicks.

The pricing is based on the database instance size (in terms of EC2 instances) as follows:

EC2 instance type Virtual cores RAM (GB) EC2 cost

($/hour)

Software price

($/hour)

Data volume estimate*

(triples)

GraphDB Free GraphDB Standard
T2.small 1 2 0.026 FREE 50 million
T2.medium 2 4 0.052 FREE 200 million
T2.large / M4.large 2 8 0.126 FREE 0.35 500 million
R3.large 2 15 0.175 FREE 0.40 1 billion
R3.xlarge 4 30.5 0.35 FREE 0.75 2 billion
R3.2xlarge 8 61 0.70 FREE 1.40 4 billion

* Note that the optimum server configuration for your data volume will depend on several factors, such as: the nature of the data (how connected is the graph, what is the ratio of explicit to implicit statements, etc.), the query volume and the complexity of the queries. The provided estimates should be used only as a starting point and we recommend that additional benchmarking is performed with data & queries specific to the use case, so that the optimum server configuration can be selected.

The costs for the network attached SSD storage (EBS volumes, charged at $0.10 per GB per month) also depends on the database size: as an average metric, 1 million triples will require at least 100 MB of storage space. Also note that the users are charged for the software purchased via the AWS Marketplace only when the AMI is running – if the instance is stopped or terminated, there are no ongoing charges, except for the EBS storage used by the database at rest.

Fully Managed Semantic Graph Database

The fully managed database-as-a-service provides a version of the GraphDB semantic graph database where developers do not need to configure, operate and maintain the database instances. All DBA related tasks are taken care of by the S4 platform on behalf of the developers. Several DBaaS options are available, with monthly subscription pricing based on database size:

DBaaS type Data volume

(triples)

Price

($/month)

XS 10 million FREE
S 50 million $49
M 250 million $179
L 1 billion $449

The fully managed databases are instantly available to developers, and accessible via simple RESTful API based on the OpenRDF API standard for RDF data management.

Next Steps

S4 provides capabilities for Smart Data prototyping, based on enterprise Semantic Technology by Ontotext. Register for a developer account, take a look at the documentation and sample code in various programming languages, and start experimenting with text analytics, semantic graph databases and open knowledge graphs!

Marin Dimitrov

Marin Dimitrov

CTO at Ontotext
As the technological captain of Ontotext, he is leading the company on the right tech route and reserving our spot on the map of the world. His sharp mind can explain complex things in a simple way, making him an invaluable resource in semantics. Marin is a frequent speaker on semantic conferences and open data meetups at various technology related events.
Marin Dimitrov

Related Posts

  • Open data fosters a culture of creativity and innovation

    Open Data Innovation? Open Your Data And See It Happen.

    As more and more companies and startups are creating business and social value out of open data, the open data trend-setting governments and local authorities are not sitting idle and are opening up data sets and actively encouraging citizens, developers, and firms to innovate with open data.

  • Linked Open Data Sets

    Linked Data Innovation – A Key To Foster Business Growth

      ‘Data is the new oil’, once said Neelie Kroes,  former Vice-President of the European Commission responsible for the Digital Agenda, aptly describing how the growing amounts of data are changing businesses and our lives. The year…

  • featured image

    Linked Open Data for Cultural Heritage and Digital Humanities

    The Galleries, Libraries, Archives and Museums (GLAM) sector deals with complex and varied data. Integrating that data, especially across institutions, has always been a challenge. On the other hand, the value of linked data is especially high in this sector, since culture by its very nature is cross-border and interlinked.

Back to top