Posted on

June 24: From BI to AI (Anaconda, Ataccama, Databricks, Dataiku, DataRobot, Domino Data Lab, Precisely, Prophecy, PythonAnywhere, Starburst, Varada)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Funding

Bain Capital Invests $150M into Data Management Platform Ataccama

On June 22, data management platform provider Ataccama announced that they had received $150M in growth capital from Bain Capital Tech Opportunities as a minority investment. The funds will go towards sales and marketing, R+D around new product innovation, and global expansion. Ataccama has only taken funding once before in the form of a $500K seed round in 2010 when it spun off from big data company Adastra; this much more significant investment indicates a desire to grow more quickly to take on the IPaaS competition like Informatica and Talend.

Launches and Updates

At Everyday AI, Dataiku Debuts Dataiku 11

At their Everyday AI conference this week in London, data science and AI platform Dataiku launched Dataiku 11. Key features of this major release include optimized tooling for advanced users, an integrated data labeling framework for inline image annotation, a visual interface for computer vision tasks allowing data scientists at all levels to work on models for complex object detection and image classification, and expanded capabilities around Responsible AI and AI governance. Dataiku 11 also includes tools for non-coding team members such as a no-code visual time series forecasting capability, a centralized feature store and workflows for more easily sharing and reusing existing work, and “what-if” accelerators to evaluate potential business outcomes in a codeless way.

Domino Data Lab Announces Nexus, a Hybrid MLOps Architecture

First previewed at the Rev 3 conference last month, Domino officially launched its Nexus hybrid MLOps architecture this week. Customers using Nexus will be able to use owned on-prem NVIDIA GPUs for cost optimization, while also having the ability to scale workloads to include cloud-based GPUs when they don’t have enough capacity on-prem. NVIDIA is a launch partner of NEXUS, and Domino has joined the NVIDIA AI Accelerated program as part of their ongoing partnership around building, managing, and deploying GPU-trained models.

Precisely Launches Data Integrity Suite

Data integrity platform Precisely announced the Precisely Data Integrity Suite, a collection of SaaS modules that can be deployed individually or in concert to provide businesses with trustable data. The Data Integration, Data Observability, and Data Governance modules are now available for early access, while modules for Data Quality, Geo Addressing, Spacial Analytics, and Data Enrichment are forthcoming.

Prophecy Launches Low-Code “Prophecy for Databricks”

Low-code data engineering platform Prophecy launched Prophecy for Databricks this week. Prophecy for Databricks is a drag-and-drop interface to create and launch data pipelines on Spark, empowering data analysts to become “citizen data engineers.” The visual interface generates PySpark or Scala code to create these pipelines, then uses standard Databricks Workflows to manage the pipelines in production. Databricks users can access Prophecy for Databricks through Databricks Partner Connect.

Acquisitions

Anaconda Acquires Cloud-Based Development Environment PythonAnywhere

Earlier this week, Anaconda acquired cloud-based Python development and hosting platform PythonAnywhere. Anaconda users will now be able to use Python in a cloud environment, accessing the PythonAnywhere development environment from any web browser and allowing for better team collaboration and asset sharing.

Starburst Acquires Data Lake Analytics Accelerator Varada

Analytics company Starburst announced this week that they had acquired Varada, a data lake analytics accelerator. Varada’s proprietary indexing technology drew Starburst’s interest in hopes of advancing the performance and cost efficiency of their existing query engine. The rollout is expected to be quick; Starburst is expecting to roll Varada’s capabilities out to select customers by the end of July, with general availability in the fall of 2022.

Hiring

Chris Riley Joins DataRobot as President of Worldwide Field Operations

On June 21, DataRobot announced that they had appointed Chris Riley as the President of Worldwide Field Operations. Riley comes to DataRobot from Automation Anywhere, a robotic process automation company, where he served as the Chief Revenue Officer. Prior to that, Riley spent time at Dell as the President of Dell Technologies Select, and as President of the Americas for Dell Technologies.

Events

Databricks Data + AI Summit 2022, June 27-30

The 2022 Databricks Data + AI Summit will be held in-person in San Francisco and virtually, June 27-30, with the theme “Destination Lakehouse” to focus on how the modern data stack functions to turn data into actions more quickly. Key speakers include Databricks co-founders Ali Ghodsi, Matei Zaharia, and Reynold Xin; Google Brain and Coursera co-founder Andrew Ng; Christopher Manning, director of the Stanford AI Lab; Insitro founder and CEO Daphne Koller; Hidden Door co-founder and CEO Hilary Mason; AI pioneer Peter Norvig; Girls Who Code CEO Tarika Barrett; and Thoughtworks director Zhamak Dehghani. The conference is sold out in person, but to attend virtually, register at Data + AI Summit.

Posted on

June 17: From BI to AI (Anaconda, Domino Data Lab, H2O.ai, Informatica, KNIME, Matillion, Okera, Snowflake, Yellowbrick)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Snowflakes in June

The biggest news: Snowflake Summit 2022 was this week, and a wide variety of data companies released announcements in conjunction with the conference, whether technical or fiscal in nature.

Snowflake Releases Unistore, A Workload Combining Transactional and Analytical Data in One Platform

Snowflake itself had several major announcements at Snowflake Summit 2022. The first covered the debut of Unistore, a workload that will allow Snowflake users to store transactional and analytical data together in a Snowflake data warehouse. Snowflake’s new Hybrid Tables will enable this new approach; customers will be able to perform fast analytics on transactional data stored in Snowflake for more timely understanding, and build transactional apps atop Snowflake.

Snowflake Introduces Native Application Framework

Snowflake also announced a Native Application Framework. Developers will be able to build data applications on Snowflake and monetize them on the Snowflake Marketplace, allowing Snowflake consumers to install and run those applications securely in their own Snowflake instances without needing to move or share data. In conjunction with this, Informatica launched their new enterprise data integrator on Snowflake, reflecting an expanded partnership with Snowflake.

The Register interviewed Amalgam Insights’ Hyoun Park on Snowflake’s Announcements, covering Unistore and the Snowflake Native Application Framework.

Snowflake Expands Native Python Support and Data Access with Snowpark for Python

Finally, Snowflake announced a number of changes demonstrating stronger Python support for machine learning and application development on Snowflake. First, Snowflake launched Snowpark for Python into public preview, broadening from existing Scala and Java support. This means that Python’s open-source packages and libraries are now accessible within Snowpark, providing a strong foundation for the most popular language for building machine learning models. Additional support for Python developers includes a new Streamlit integration for easier app development on Snowflake; Snowflake Worksheets for Python to enable development of machine learning models, pipelines, and applications directly in Snowsight; large memory warehouses to support memory-intensive operations like feature engineering and model training on large datasets, enabled through Snowflake’s Anaconda integration; and SQL Machine Learning, allowing data analysts to more easily use machine learning algorithms without requiring advanced knowledge. The first algorithm available is time-series forecasting. Finally, Snowflake also increased data access with better support for ingesting and transforming streaming data, and working with data external to the Snowflake Data Cloud, even on-prem data, while still conferring some of the advantages of storing data in Snowflake.

Funding

Domino Data Lab Reveals Investment from Snowflake Ventures

Domino Data Lab announced an investment from Snowflake Ventures this week for an undisclosed amount, following up on Snowflake Ventures’ previously unannounced participation in Domino’s Series F funding round last October. The additional investment demonstrates the strength of the Snowflake-Domino partnership being robust enough for Snowflake to take an equity stake in Domino, rather than being solely a technical partnership.

Matillion Announces Snowflake Ventures Investment

Data integration platform Matillion also announced an investment from Snowflake Ventures. As part of this ongoing “investipartnership,” Matillion will be among the first Snowflake data integration partners to use the just-announced Snowflake Native Application Framework by making Matillion connectors available directly within Snowflake.

Launches and Updates

KNIME Software Release: Improved Python, Snowflake Integrations

KNIME announced the latest release of their data science platform. Key new features include upgrades to KNIME’s Python support with a built-in Python environment and the ability to write KNIME extensions entirely in Python, as well as a Snowflake integration that allows users to build machine learning models in H2O.ai, and then push the model down to Snowflake for predictions.

Okera Now Generally Available on Snowflake

Data security and governance company Okera announced that Okera was now available on Snowflake as a SaaS offering for Snowflake Data Cloud. Okera’s universal data authorizaton policies are automatically translated into Snowflake data access governance controls, allowing native data security policy enforcement within Snowflake.

Yellowbrick Launches Latest Version of its Data Warehouse

Cloud data warehouse Yellowbrick released a new version of its platform this week. Key features include on-prem and AWS deployment options (Azure and Google Cloud Platform coming in Q3), data lake integration using Parquet, separation of compute and storage for more elastic scaling on demand, and multiple payment models (consumable either on-demand or through a subscription based on fixed capacity). Yellowbrick also announced two new partnerships with Saarthee and Saxon, two data and analytics companies.

Partnerships

H2O.ai Expands Snowflake Partnership

H2O.ai continues to grow its Snowflake partnership. Users are able to use H2O.ai machine learning capabilities on the data within their Snowflake environment; H2O.ai is expanding support for financial services, manufacturing, and healthcare customers doing machine learning.

Posted on

June 10: From BI to AI (Amazon SageMaker, Databricks, Dataiku, DataRobot, Expert.ai, Google Cloud, Immuta, Informatica, KNIME, Labelbox, Matillion, Neo4j, NVIDIA, Qlik, RapidMiner, Snowflake, Teradata, TIBCO)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Funding

Immuta Raises $100 Million Series E Round

On June 8, secure data access platform Immuta announced that it had raised $100M in Series E funding. NightDragon led the round, with participation from new investor Snowflake Ventures, and prior investors Dell Technologies Capital, DFJ Growth, IAG, Intel Capital, March Capital, StepStone, Ten Eleven Ventures, and Wipro Ventures. Immuta will use the funds for additional hiring in sales, marketing, and customer success, as well as continued R+D and building out strategic partnerships with other vendors in the cloud data space.

Matillion Reveals Strategic Investment from Citi Ventures

Enterprise cloud data integration platform Matillion announced a strategic investment from Citi Ventures this week for an undisclosed amount. Matillion’s last publicly shared valuation was $1.5B, after their series E round last September for $150M.

Launches, Updates, and Partnerships

Databricks Delivers Data Lineage For Unity Catalog

Databricks announced that data lineage for Unity Catalog is now available in preview on AWS and Microsoft Azure. The data lineage feature will let customers understand the history of any data in their lakehouse – where it came from, when was it created, who created it, how has it been modified from the original raw data import, and how it’s being used, among other features. Because this is done automatically, the results save time and provide better accuracy compared to manually tagging data with the relevant metadata, and allow organizations to better meet compliance standards and relevant regulations.

Dataiku Arrives on Azure

Dataiku announced a partnership with Microsoft Azure this week, launching the Dataiku cloud AI platform in the Azure cloud. Dataiku’s new cloud stack accelerator capability allows for automated deployment, configuration, and management of Dataiku’s Everyday AI platform on Azure with a template-based approach.

DataRobot Debuts AI Cloud Improvements at DataRobot AIX 2022

At DataRobot AIX 2022, DataRobot announced a number of improvements to their AI Cloud product. Notable enhancements include code-first notebooks integrated into AI Cloud, bringing capabilities from the recent Zepl acquisition into DataRobot’s offerings and augmenting support for code-centric data scientists; expanded enterprise-level MLOps capabilities for the full model lifecycle, including integrations with GitHub, SumoLogic, Splunk, Datadog, and Zendesk; bias mitigation that automatically identifies and adapts machine learning models exhibiting detectable bias prior to deployment; and automated compliance documentation, even for models built outside of DataRobot. DataRobot also broadened their partnership with Google Cloud, launching AI Cloud in the Google Cloud Marketplace.

Expert.ai Imports Its Natural Language Capabilities to Qlik

Expert.ai announced this week that it has joined the Qlik Technology Partner Program. Qlik users will be able to use expert.ai language intelligence within Qlik Cloud, including natural language capabilities such as sentiment analysis, document categorization, and text disambiguation.

New Features and Partnerships for Google Cloud Vertex AI

At this week’s Google Cloud Applied ML Summit, Google revealed numerous new features and partnerships for their applied machine learning product, Vertex AI. Google’s existing NVIDIA partnership yielded one-click deploy of NVIDIA AI solutions to Vertex AI Workbench, as well as the new Vertex AI Training Reduction Server, which optimizes multi-node distributed training on NVIDIA GPUs, reducing training time for large language models like BERT. Google also announced a new data partnership with Neo4j, allowing data scientists to work with data and build models in Neo4j Graph Data Science, then deploy the models using Vertex AI. One more partnership with Labelbox provided yet another integration, reducing the time required to label unstructured data and speed up the model development process. Finally, Google also announced the preview of several standalone features: Vertex AI Tabular Workflows, allowing users to choose which parts of the model building and deployment processes they want to use AutoML for while being more hands-on with other parts; Serverless Spark for Vertex AI Workbench for data scientists to launch a server less spark session within a notebook; and Vertex AI Example-Based Explanations, which helps data scientists diagnose issues in their models using explainable AI techniques.

Informatica Updates Global Partner Program with Three Initiatives

Informatica revealed enhancements for its Global Channel Partner Program this week to boost partnered sales and support efforts for cloud modernization with joint customers. The new initiatives include incentives to source bookings for Gold and Platinum-level partners; sales, delivery, and technical certifications to help partners in their engagements with joint customers; and a points-based Channel Rewards program to recognize individuals for their contributions.

KNIME Announces Strategic Partnership with Snowflake

Open source data science company KNIME announced a strategic partnership with Snowflake. Users will be able to use the low/no-code KNIME Analytics Platform to perform analytics on data stored in Snowflake.

RapidMiner Releases New Version of Cloud Platform

RapidMiner announced the release of a new version of their data science platform. The latest version marks a move to the cloud as a multi-tenant, SaaS offering.

Teradata Vantage with Amazon SageMaker Launches

Enterprise data platform Teradata introduced Teradata Vantage, a multi-cloud analytics platform integrated with machine learning service Amazon SageMaker. The partnership will allow Teradata customers to access machine learning capabilities via Amazon and apply it to data and analytics hosted on Teradata.

Events

TIBCO Analytics Forum 2022 to Occur June 13-15

TIBCO Analytics Forum (TAF) returns June 13-15, 2022. The online-only event has a theme of “Analytics in Time and Space.” Featured speakers include Ben Shneiderman, computer science professor and founding director of the Human-Computer Interaction Laboratory at the University of Maryland; data visualization guru Nadieh Brehmer; David Baltar Boilève, data scientist at Hospital Universitario Lucus Augusti; Mark Lora, director of enterprise data systems, Taylor University; and Birchcliff Energy analytics engineer Monica Brookwell, among others. To register for the event, please visit TAF 2022

Posted on

June 3: From BI to AI (bodo.ai, Gigasheet, Incorta, One AI, Oracle, Rockset, Saturn Cloud)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Funding

Gigasheet Raises $7M Series A Round

Gigasheet, a no-code analytics platform, announced that they had secured $7M in Series A funding. Participants in the funding round included Accomplice, Argon, Founder Collective, and REV, along with individual investors. The funds will go towards filling out their product road map and expanding their future enterprise offering.

One AI Announces $8M Seed Round, Launches NLP-as-a-Service

One AI, a natural language processing provider, announced that they had raised $8M in seed funding from angel investors. Along with the funding, One AI emerged from stealth, launching their NLP-as-a-Service offering. Their Language Skills API includes a number of NLP models for specific business use cases such as conversation and article summarization, clustering and text analytics, and emotion and sentiment extraction, among others. Developers will be able to use these models to transform unstructured text into structured data.

Launches and Updates

Incorta Integrates Delta Sharing, Data Apps

Incorta, a realtime analytics platform, debuted new capabilities this week. Among the new features are a native Delta Sharing integration, allowing Incorta customers to securely share operational data more quickly. Incorta also launched several data apps that acquire operational data from source systems and prepare it for analysis, with already-built business schemas and dashboards for Oracle EBS, Oracle ERP and EPM Clouds, Netsuite, SAP, and others.

Rockset Reveals Oracle Integration

Analytics platform Rockset announced a new integration with Oracle this week, allowing developers to run search, aggregations, and joins on data from Oracle databases in real time. Rockset ingests change data capture streams from Oracle, enabling swift analytical queries.

Saturn Cloud and Bodo.ai Announce Partnership to Make Python Analytics More Performant

Data science and machine learning platform Saturn Cloud and parallel data compute platform bodo.ai have launched a partnership. Bodo.ai software running within Saturn Cloud resources will allow data scientists to scale up their model prototypes to “petabyte-scale parallel processing production” without requiring tuning or re-coding a model for scaling.

Posted on

May 27: From BI to AI (Alteryx, Anaconda, Google, DataRobot, Hugging Face, Informatica, MANTA, Microsoft, Oracle)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Funding

Data Lineage Platform MANTA Announces $35M Series B

MANTA, an automated data lineage platform, announced that it had raised $35M in Series B funding. Forestay Capital led the round; existing investors Bessemer Venture Partners, Credo Ventures, SAP.io, and Senovo, and new investor European Bank for Reconstruction and Development also participated.

Informatica World 2022

Informatica Announces Numerous Updates at Informatica World 2022

At Informatica World, Informatica debuted a number of improvements for data management, data analytics, and data governance. New to the fold is INFACore, a plugin for data science and development frameworks to provide data management in data science and data engineering development environments, simplifying the process of composing data pipelines and deploying them to data apps. Informatica also launched Informatica ModelServe, a service to permit users to more easily deploy machine learning models.

Informatica continued growing their portfolio of vertical-specific versions of their Intelligent Data Management Cloud. The IDMC for Healthcare and Life Sciences addresses the need for master data management that provides a “single source of truth” on patient and provider data while complying with HIPAA regulations and significant governance requirements, as well as data quality rules gathered into the Data Quality Accelerator for Crisis Response to cleanse, standardize, and validate healthcare data. IDMC for Healthcare and Life Sciences also supports connectivity to common healthcare software packages. The IDMC for Financial Services complies with financial industry regulations, supports financial industry data standards, supplies a set of financial-industry-specific data rules gathered into the Data Quality Accelerator for Financial Services to process financial data, and provides metadata scanners specialized for extracting metadata from financial data.

Finally, Informatica announced several major partnerships with large cloud vendors. With Google, they launched the Informatica Data Loader for Google BigQuery, a no-code SaaS service that Google Cloud customers can use to quickly ingest data into their Google BigQuery cloud data warehouse. With Azure, Informatica announced a SaaS version of Informatica Master Data Management on Azure, currently in private preview. And with Oracle, Informatica has integrated Informatica’s Intelligent Data Management Cloud with Oracle Autonomous Database, Oracle Exadata Database Service, Oracle Exadata Cloud@ Customer, and Oracle Object Storage. Oracle, in turn, has named Informatica as a preferred partner for enterprise cloud data integration and data governance for data warehouse and lakehouse solutions on Oracle Cloud Infrastructure.

Microsoft Build

Microsoft Integrates Power BI Into PowerPoint, Outlook, and the Office Hub; Launches Datamart Capabilities Within Power BI

At their Build conference this week, Microsoft unofficially welcomed Power BI to the Microsoft Office family as it announced significant integrations of Power BI into PowerPoint, Outlook, and the Office Hub at this week’s Microsoft Build conference. Power BI reports can now be embedded into Power Point and Outlook, bringing data interactivity capabilities into presentations and emails, and replacing out-of-date screenshots with live data visualizations that can be sliced, filtered, and drilled down into. Users will also be able to launch Power BI and find and consume related content directly from the Office Hub.

Microsoft also released a self-service “datamart” capability within Power BI. Business analysts will be able to use a no-code interface to build a data mart on top of any data warehouse or combination of data sources that can be centrally managed and governed, without needing to go through IT, saving time on both sides. The data mart automatically generates a dataset ready for report-building in Power BI, and users can find data marts easily in the Power BI Data Hub, Excel, and Teams.

Also at Microsoft Build, Microsoft Azure AI announced two updates to Azure Cognitive Services. Azure OpenAI Service allows customers to implement reasoning and comprehension capabilities for use cases such as code generation, writing assistance, and deconstructing unstructured data. Azure Cognitive Service for Language adds document and conversational summarization capabilities to help surface key information from unstructured data such as documents and contact center calls.

Azure Machine Learning revealed a number of updates. The Azure Machine Learning responsible AI dashboard, now in preview, unites a number of capabilities to assess machine learning models in one pane of glass. Azure Machine Learning managed endpoints allow developers and data scientists to more easily deploy large-scale models; these managed endpoints are now generally available. (Machine learning platform Hugging Face is among the vendors collaboratively announcing their own endpoints powered by Azure Machine Learning’s managed endpoints service.) In addition, new AutoML features include support for natural language processing and image tasks, as well as enhancements for product integration and machine learning ops.

Finally, Microsoft also launched the Microsoft Intelligent Data Platform, an integrated platform that brings together databases, analytics, and governance. Within the platform, customers have access to four different Microsoft databases, three different Microsoft analytics services, and Microsoft Purview for governance.

Additional Launches and Updates

Alteryx Debuts FIPS-Compatible Version of Alteryx Designer

On May 25, Alteryx announced Alteryx Designer-FIPS, a version of Alteryx Designer that follows the data security and computer system standards specified in the Federal Information Processing Standards. With this announcement, government agencies and other public sector organizations will be able to automate analytics while complying with FIPS.

Hiring

Anaconda Names Shahz Afzal as SVP of Marketing & Strategy

On May 25, Anaconda welcomed Shahz Afzal as the SVP of Marketing and Strategy, to oversee Anaconda customer and open-source community engagement. Afzal was most recently the Global Head of ISV (Independent Software Vendors) at AWS, in charge of go-to-market strategies, data providers, and consulting partners within the AWS Marketplace. Prior to that, Afzal was Vice President of Marketing for IBM’s Hybrid Cloud unit, and spent 15 years at Microsoft overseeing cloud transition initiatives. Anaconda also brought Python thought leaders Russell Keith-Magee and Antonio Cuni on board to support Anaconda as a force for data science in Python.

DataRobot Appoints Former Salesforce CFO Mark Hawkins as Chairman of the Board

On May 24, DataRobot announced that they had appointed Mark Hawkins as Chairman of the Board of Directors. Hawkins has served on the DataRobot Board of Directors since 2021. Previously, Hawkins was the President and CFO for Salesforce, which went public and increased its valuation by over $200M during his tenure. Hawkins brings to DataRobot 35 years of experience leading finance teams within global technology companies including Autodesk, Logitech, Dell, and Hewlett-Packard.

Posted on

May 20: From BI to AI (Alteryx, Apollo, Databricks, Franz, Ground Labs, Heartex, Imply, Komprise, Okera, Tableau)

Funding

Heartex Raises $25 Million Series A

Heartex, the company behind open source data labeling platform Label Studio, announced that it had raised $25M in Series A funding. Redpoint Ventures led the round, with participation from existing investors Bow Capital, Swift Ventures, Unusual Ventures, and angel investors. Funding will go towards R+D for Label Studio, focusing on bias detection and mitigation, labeling automation, and other analytics and data quality management capabilities.

Imply Announces $100M Series D Round at $1.1B Valuation

Imply, an analytics database company, raised a $100M Series D funding round this week. Thoma Bravo led the round, with participation from new investors OMERS Growth Equity and existing investors Andreessen Horowitz, Bessemer Venture Funds, and Khosla Ventures.

Launches and Updates

Alteryx Reveals New Cloud Capabilities at Inspire Conference

On May 18, at Alteryx Inspire, Alteryx revealed improvements to its analytics platform. Key new capabilities include text mining and computer vision additions to the Alteryx Intelligence Suite to analyze unstructured data, enhancements to predictive time series modeling in Alteryx Machine Learning, and the integration of Trifacta into Alteryx Designer Cloud, which adds SSH tunneling to enhance cloud security and governance capabilities.

Tableau Cloud Debuts At Annual Tableau Conference

On May 17, Tableau revealed Tableau Cloud, the next generation of its previous Tableau Online offering, at its annual Tableau conference. New features include Data Stories, automatically-generated natural language generated explanations of Tableau Dashboards to help users more thoroughly understand what they’re seeing; expanding the number of Accelerators (customizable dashboard templates) in its Tableau Exchange store; and further integrating Einstein Discovery and Tableau capabilities within Salesforce CRM Analytics with text clustering to extract keywords from large text fields, and bias detection for multi class models to pinpoint the biasing variables in a model to be removed without requiring model retraining.

Franz’s AllegroGraph 7.3 Improves GraphQL Capabilities

On May 17, Franz Inc, a graph database company, announced AllegroGraph 7.3. This latest version includes upgraded GraphQL query capabilities to work with distributed knowledge graphs and data fabrics. Enhancements to the GraphQL APIs will allow for more complex and performant queries to support data-driven apps.

Apollo GraphQL Releases the Supergraph

On May 18, Apollo GraphQL launched what they’re calling the “supergraph,” a layer to facilitate collaboration between backend data and services and the apps and devices on the front end, but with ambitions for supporting future business needs in an agile composable manner. As part of defining their supergraph stack, Apollo is also launching Apollo Router, which processes GraphQL queries and returns results back to the client significantly faster than its predecessor. Finally, Apollo is adding two features to Apollo Studio’s free tier – Schema Checks, which audits newly composed schemas to protect client apps’ processing, and Launches, which provides a window into the schema-checking and schema-launching processes in Studio.

Ground Labs Reveals Enterprise Recon 2.6

Data discovery company Ground Labs announced the general availability of Enterprise Recon 2.6, a tool that allows companies to find and remediate personally identifiable information (PII) and other sensitive information. New features include improvements to data access governance; new reporting features such as risk scoring and labeling; and the ability to scan for sensitive information on data sources like Google Cloud Storage, SAP HANA databases, Salesforce CRM, and Cloudera Distribution for Hadoop.

Komprise Debuts Smart Data Workflows

Komprise, an unstructured data discovery company, announced Komprise Smart Data Workflows, a process to find and discover relevant data across on-prem, cloud, and edge devices, and then direct said data to data lakes and AI and machine learning tools. Core improvements include expanding Deep Analytics Actions to include “copy and confine” actions from Deep Analytics queries, adding the ability to execute external functions via an API, and increasing global tagging and search capabilities in workflows.

Hiring

Databricks Welcomes Trâm Phi as Senior Vice President and General Counsel

On May 19, Databricks announced that they had appointed Trâm Phi as Senior Vice President and General Counsel. Prior to Databricks, Phi served as SVP, General Counsel at DocuSign, where she grew the legal function as DocuSign transitioned to a mature public company. Before that, Phi was the Chief Legal Officer and Chief of Staff at Imperva, and the Vice President, General Counsel at ArcSight, where she led both teams as the companies went public.

Events

Okera to Host AIRSIDE LIVE 2022 May 25-26

Okera will host AIRSIDE LIVE 2022 as both an in-person event in New York and a virtual event. The event will focus on four key pillars: data management, data security, data privacy and governance, and data as a product, to help companies safely and securely put their data to use. Featured speakers include Aaron Carreras and Nate Weisz from FINRA; Cheryl Flink from the Center for Creative Leadership,; Jeff Becraf, head of US Sales for Data and AI at Kindryl; Guy Adams, Chief Technology Officer at dataops.live, and Mike Meriton, co-founder and COO of the EDM Council. To attend the event, register on the Airside LIVE 2022 site.

Posted on

May 13: From BI to AI (AWS, Databricks, DataRobot, Dremio, Google, Hugging Face, Mindtech, Predibase, Privacera, Pyramid Analytics, Snowflake, Starburst, ThoughtSpot)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Funding

Hugging Face Raises $100M C Round

Machine learning platform Hugging Face announced May 9 that they had raised $100M in Series C funding. Lux Capital led the round, with participation from Addition, AIX Ventures, a_capital, Betaworks, Coatue, Sequoia, SV Angel, and individual angel investors. Hugging Face will use the funding on R+D, product development, and education.

Predibase Announces $16.25M in Seed and A Round Funding, Emerges From Stealth

Predibase, a low-code machine learning platform, came out of stealth this week and announced $16.25M in Series A and seed funding. Greylock led both rounds, with participation from The Factory and angel investors. The funding will go towards hiring additional engineering and ML talent, as well as the go-to-market strategy and bringing Predibase into general availability.

Pyramid Analytics Closes $120M E Round

Pyramid Analytics, a decision intelligence and augmented analytics platform, announced on May 9 that they had closed $120M in Series E financing. HIG Growth Partners led the round, with participation from existing investors Clal Insurance Enterprises Holdings, General Oriental Investments, and Kingfisher Capital, and new investors JVP, Maor Investments, Sequoia Capital, and Viola Growth. Pyramid will use the money on product development, continued R+D, expanding partnerships globally, and hiring to support all of these efforts.

Launches and Updates

Databricks Strengthens AWS Partnership, Announces PAYGO Lakehouse Offering

On May 10, Databricks announced a pay-as-you-go lakehouse offering on AWS, available now. Customers will be able to launch a lakehouse from the AWS Marketplace whether or not they’ve used Databricks before; they can set up a Databricks account from within AWS, and even consolidate their Databricks usage bills into their AWS billing.

Google Serves Up LaMDA 2 Demos in its AI Test Kitchen

At Google I/O, Google unveiled its new Language Model for Dialog Applications (LaMDA) 2 conversational AI model, along with AI Test Kitchen, an app to demonstrate use cases for LaMDA 2. LaMDA is a generative text model, aiming to produce relevant textual responses based on patterns it recognizes from linguistic input. While no date has been announced for general availability, Google plans to open up LaMDA access to small groups of people.

Mindtech Chameleon Now Generates Diverse “Actors” to Address Bias

Mindtech Global, a synthetic data creation platform, has announced updates to its Chameleon platform. Chameleon 22.1 lets users automatically generate millions of “actors” in virtual worlds, creating privacy-compliant synthetic visual data for training computer vision systems. To address known bias issues, the Chameleon actors now have a range of configuration options, including height, build, age, skin tone, and clothing and hairstyle options.

Privacera Announces Release of Platform 6.3 and PrivaceraCloud 4.3

Privacera, a data access governance company, announced the release of the latest version of Privacera Platform 6.3 and PrivaceraCloud 4.3. New features include extending Attribute Based Access Control across all supported data and analytical sources, supplementing existing role-based and tag-based access control mechanisms; along with enhanced support for Google BigQuery, Starburst Enterprise, Databricks, and Snowflake.

ThoughtSpot Expands the Modern Analytics Cloud to Help Companies Dominate the Decade of Data

At their Beyond 2022 event this week, ThoughtSpot announced numerous new capabilities for their cloud analytics platform. Key updates include integrations and connectors for Amazon Redshift Serverless, Snowflake Data Marketplace, Databricks Partner Connect, Dremio, and Starburst Galaxy; templates, code samples, and ThoughtSpot Blocks to accelerate the development process; and automation capabilities to trigger actions based on analytics.

Events

DataRobot AIX 22 Celebrates AI Innovation, June 7-8, 2022

On June 7 and 8, DataRobot will host DataRobot AIX 22, a free virtual event to explore innovation in AI, analytics, and data science. Featured speakers include DataRobot executives Dan Wright, Debanjan Saha, Nenshad Bardoliwalla, and Michael Schmidt. Register for the event at the DataRobot AIX 22 website.

Posted on

May 6: From BI to AI (Accern, Domino, dotData, Exasol, Galileo, Google Cloud, Mathworks, Salesforce, Starburst)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Funding

Accern Raises $20M Series B Round

On May 2, Accern, a natural language processing platform for unstructured data streams, announced a $20M Series B round of financing. Fusion Fund and Mighty Capital co-led the round, with additional participation from Gaingels, Shasta Ventures, Tribe Capital, and Viaduct Ventures. The funding will go towards expanding sales and marketing.

dotData Closes $31.6M B Round

dotData, a data science automation platform, announced that it had raised a $31.6M Series B funding round. Otsuka Corporation, Sumitomo Mitsui Banking Corporation, and Sumitomo Mitsui Trust Bank, Ltd all participated in this round. dotData plans to use the money for product and service development, as well as expanding on sales efforts and business partnerships.

Galileo Emerges From Stealth with $5.1M Seed Round

Data intelligence platform Galileo emerged from stealth earlier this week with a $5.1M seed round. The Factory led the funding round, along with participation from additional angel investors. The funds will be used to accelerate hiring and R+D. The Galileo platform addresses data errors throughout the machine learning model lifecycle, focusing on unstructured data. Galileo is currently in private beta.

Launches and Updates

Domino 5.2 Release Announced, Generally Available in June

On May 5 at the Rev 3 conference, Domino Data Lab announced Domino 5.2, which will become generally available in June 2022. Key new features include the IntelliSize capability within Domino’s Durable Workspace, which will recommend the optimal size for a model development environment; a new data prep and visualization environment based on Apache Superset; integration with Snowflake to train models in-database in Snowpark, then deploy them to the Data Cloud for in-database scoring; and realtime model monitoring in the Snowflake Data Cloud.

Exasol Reveals New Capabilities for Exasol SaaS

Analytics database company Exasol announced that its Exasol SaaS database now interfaces natively with Keboola, a cloud-based Data Stack as a Service platform. In addition to the Keboola collaboration, additional new capabilities of Exasol SaaS include enhanced machine learning capabilities within the database, support for data virtualization without needing to migrate data, and an Amazon SageMaker extension that uses SageMaker AutoPilot to develop machine learning projects based on data stored in Exasol.

Google Cloud Launches New Data Solutions for Manufacturers

This week, Google Cloud launched two data solutions specifically for manufacturers, Manufacturing Data Engine and Manufacturing Connect. Manufacturing Data Engine integrates a number of key Google Cloud products (such as BigQuery and Looker, among others) with a platform for ingesting, transforming, storing, and providing access to factory data. Manufacturing Connect is an edge solution that can connect to and stream machine and sensor data from manufacturing assets and systems directly to Google Cloud.

MathWorks Launches Startup-Targeted Suites

MathWorks is now offering tech startups access to suites that include MATLAB, Simulink, and over 100 industry-specific development tool stacks at a reduced price. Startups who meet MathWorks’ criteria as determined through an application process may be eligible for “startup-friendly” pricing based on company size, annual revenue, and how long the company has been in business. Access to the MATLAB suite for qualified startups now starts at $1500 for an individual seat, while pricing for the MATLAB and Simulink suite starts at $3600 for an individual seat. Electric vehicle and automated driving startups are highlighted as particular areas of interest on the MathWorks for Startups website.

Starburst Announces New Capabilities for Galaxy, Enterprise

At the Trino Summit this week, analytics engine company Starburst revealed new capabilities for its Starburst Galaxy and Starburst Enterprise products. Galaxy now includes a new lakehouse capability, Great Lakes Connector for object storage catalogs, which now supports the Iceberg and Delta Lake table formats in addition to Apache Hive, allowing access to all of these formats in one data lake. New features for Enterprise include built-in access control for consistent governance across Enterprise, as well as a REST API to make it easier to build and use data products.

Hiring

CIA CISO Joins Salesforce – Salesforce News

On May 5, Salesforce announced that former CIA Chief Information and Security Officer William MacMillan has joined Salesforce as the SVP of Security Product and Program Management, BISO, and Acquisition Integration. MacMillan joins Salesforce after a nearly 20-year career with the CIA culminating in the CISO role. Prior to that, MacMillan was a combat rescue and special operations pilot in the Air Force.

Posted on

April 29: From BI to AI (Akamai, Arize AI, Baseten, Credo AI, Enveil, Exafunction, expert.ai, HPE, Informatica, Linode, RelationalAI, Salesforce, Snowflake, Synthesis AI, Tableau)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Funding

Baseten Raises $12M A Round, Launches Product to Turn Machine Learning Models into Apps

Baseten, a platform that turns machine learning models into web applications, announced on April 26 that they had raised $20M in combined seed and Series A funding. The seed round of $8M was co-led by Greylock and South Park Commons Fund, while the A round was led by Greylock. Baseten also formally launched their product into public beta. Amalgam Insights’ Hyoun Park was quoted in Baseten’s press release announcing the funding and launch.

Enveil Secures $25M B Round

Enveil, a data privacy company, announced that it had closed $25M in Series B financing this week. USAA led the oversubscribed round, with participation from existing investors Bloomberg Beta, Capital One Ventures, C5 Capital, Cyber Mentor Fund, DataTribe, GC&H, In-Q-Tel, Mastercard, and 1843 Capital. Enveil plans to put the funds towards product development, expanding sales, and marketing.

Exafunction Raises $25 Million Series A Funding Led by Greenoaks | Business Wire

Exafunction, a deep learning infrastructure company, announced $25M in Series A financing. Greenoaks led the round, with Founders Fund participating. The funding will go towards R+D, customer training, and performance improvements, in particular around GPU virtualization.

RelationalAI Raises $75M B Round, Bringing Total Funding to $122M

On April 26, RelationalAI, a knowledge graph system builder, announced that they had closed $75M in a Series B funding round. Tiger Global led the round, with participation from existing investors Addition, Madrona Venture Group, and Menlo Ventures. The funding will go towards R+D and go-to-market activities. As part of the transaction, Bob Muglia, former CEO of Snowflake, has joined the RelationalAI board.

Synthesis AI Raises $17M in Series A Funding

On April 28, Synthesis AI, a synthetic data platform, announced that they had closed $17M in Series A financing. New investor 468 Capital led the round, with participation from additional new investors Sorenson Ventures and Strawberry Creek Ventures and existing investors Bee Partners, iRobot Boom Capital, Kubera Venture Capital, and PJC. Synthesis AI will use the funds for hiring, product development, and R+D at the intersection of AI and computer-generated imagery (CGI).

Launches and Updates

Akamai Debuts Linode Managed Database Service

On April 25, Akamai launched a managed database service powered by Linode, which Akamai acquired back in March. The service handles common maintenance and deployment tasks associated with database management, allowing for better performance and uptime. The launch reflects Akamai’s expansion into database management services to go along with its existing networking capabilities.

Arize AI Launches Bias Tracing to Address Algorithmic Bias

Arize AI, a machine learning observability platform, launched Arize Bias Tracing this week. The tool helps data science and machine learning teams monitor models for bias, discover what features and cohorts contribute to bias in a given model, and mitigate the impact of bias on said model.

Credo AI Reveals Responsible AI Governance Platform

Credo AI, a governance solution for AI, launched its Responsible AI platform this week. Key features in Responsible AI include a pipeline that ingests assessments from Credo AI Lens and translates them into risk scores across common AI risk areas; a repository for critical governance artifacts; the ability to assess AI risk and compliance of third-party AI and ML models via a dedicated portal.

Expert.ai Releases New Version of its Platform, Provides “Knowledge Model” AI Accelerators

Expert.ai, a natural language hybrid AI platform, announced the latest version of its platform on April 26. Notable features include a set of “Knowledge Models,”, which are pre-configured rules-based models that can extract entities, insights, and relationships from text within specific domains. Included domains with this release are finance, life sciences, environmental, social, and governance (ESG), personally identifiable information (PII), and behavioral and emotional traits. In addition, Azure has been added as a deployment environment, Boomi and Qlik can now be used to connect to third-party applications, and custom Python and Java can be used in the natural language workflow orchestrations.

Hewlett Packard Enterprise Reveals Machine Learning Development Environment

On April 27, Hewlett Packard Enterprise debuted their HPE Machine Learning Development System. The System consists of HPE’s machine learning platform, HPE Machine Learning Development Environment, along with hardware and software to optimize development, training, and deployment of machine learning models.

Tableau Expands Analytics Embedding Capabilities

At Salesforce TrailblazerDX ’22, Tableau announced additional capabilities around embedding analytics. These include Web Data Connector 3.0, which allows data developers to build connectors from their data to a web application; v3 of Tableau’s Embedding API, so Tableau analytics can be integrated into any application using web components; Embeddable Web Authoring, the ability to edit Tableau visualizations within any application or web portal; Connected Apps for Seamless Authentication, which permits Tableau to be integrated into developers’ applications with proper authentication; and Tableau Actions with Salesforce Flow, the ability to trigger workflows in Flow directly from a Tableau dashboard.

Partnerships

Informatica and Snowflake Integrate Governance Features

Informatica and Snowflake continue to grow their partnership. This week, Informatica announced an integration between Snowflake Data Cloud’s native governance features and Informatica’s Cloud Data Governance and Catalog. The integration will provide a dashboard allowing for easy monitoring of data governance, access controls, and end-to-end lineage.

Posted on

April 22: From BI to AI (Amazon, Arcion, Azure, Dassana, Databricks, Denodo, Grafana, IBM, Oracle, Privitar, SAS, StreamNative, TIBCO, Vianai)

If you would like your announcement to be included in Amalgam Insights’ weekly data and analytics roundups, please email lynne@amalgaminsights.com.

Funding

Dassana Surfaces From Stealth, Secures $5M in Seed Round

Cloud log lake company Dassana secured $5M in seed funding this week. Dell Technologies Capital led the round, with participation from additional angel investors. Dassana also announced that the Dassana Cloud Log Lake was now in public beta, distinguished by separating storage and compute to promote cost savings, and an optimized storage data structure to make queries to its cloud log lake highly performative.

Partnerships

Arcion Partners With Databricks

Last week, Arcion launched Arcion Cloud, its data replication platform; this week, Arcion announced that Arcion Cloud was now available via Databricks Partner Connect, enabling Databricks users to begin using Arcion to copy and move data from high-volume transactional databases without requiring coding skills.

Privitar Partners with Denodo

Data provisioning software provider Privitar announced a strategic partnership with Denodo, a data integration and management software provider. Combining Denodo’s data virtualization capabilities with Privitar’s focus on data provisioning and privacy will allow customers to provision their data in more easily reusable ways, while enforcing proper access and governance, and remaining compliant with applicable regulations.

Launches and Updates

Amazon Aurora Serverless v2 now Generally Available

At AWS Summit yesterday, Amazon announced the release of Amazon Aurora Serverless v2. Aurora Serverless is a database option for applications with unpredictable traffic that automatically scales capacity up and down as needed. Improvements to v2 include significantly quicker scaling speeds (fractions of a second as opposed to taking several seconds and up to nearly a minute in some cases with v1), and more granular scaling capabilities (v1 could only increase capacity by doubling).

Azure Managed Grafana Now In Preview

On April 18, Microsoft announced that Azure Managed Grafana was now available in preview, allowing Azure customers to integrate Grafana dashboards into their Azure environment, facilitating access to Azure services and data sources from within said dashboards. This includes the ability to securely share Grafana dashboards with Azure Active Directory.

Databricks Announces Media and Entertainment Lakehouse Offering

In the continuing rollout of vertical-specific lakehouse offerings, this week, Databricks debuted its Media and Entertainment Lakehouse. Notable features include accelerators for core industry use cases such as AI-driven recommendation engines, advertising optimization, and customer lifetime value and churn, among others.

StreamNative Launches StreamNative Cloud for Kafka

On April 21, StreamNative, a messaging and event streaming platform, announced StreamNative Cloud for Kafka. The new product addresses a couple of common issues Kafka users encounter with particularly high volumes of streaming data, while allowing users to continue ingesting streaming data formatted in the Kafka protocol but processing it under the hood using Apache Pulsar. StreamNative, founded in 2019, raised a $23M A round last fall.

TIBCO WebFOCUS 9.0.0 Makes its Debut

On April 19, TIBCO announced the release of TIBCO WebFOCUS 9.0.0. Key additions include a Container Edition of WebFOCUS, which permits Kubernetes deployments from within WebFOCUS; the Hub, a directory that makes it easier for users to access content and data from any device within WebFOCUS; and Designer updates that streamline the ability to create, manage, and stage datasets when creating content.

VIANAI Systems Introduces Vian H+AI

On April 19, Vianai Systems, an AI platform company headed by former SAP CTO Vishal Sikka, launched the Vian H+AI platform. The initial rollout includes Vian MLOps, to manage, optimize, deploy, and govern machine learning models. Key capabilities include model risk monitoring, software-based performance optimization, and quick model operationalization.

Appointments

SAS Data Ethics Director Reggie Townsend Named to National AI Advisory Committee

Reggie Townsend, director of the Data Ethics Practice for SAS, has been named to the National Artificial Intelligence Advisory Committee (NAIAC). The NAIAC advises the President on AI-related issues; members of the committee serve three-year terms. Townsend is also on the board of EqualAI, a nonprofit that works to reduce unconscious bias in AI development and usage.

Events

IBM’s Annual Think Conference to Expand Globally

IBM Think 2022 will kick off its flagship event in person in Boston on May 10, to be followed by a global “Think on Tour” series of invite-only gatherings. IBM Chairman and Chief Executive Officer Arvind Krishna will provide the opening keynote in Boston. Key conference themes will include how AI and automation provide opportunities to rethink business operations, among other topics. IBM will also present Think Broadcast on May 10 and 11, a live anchored program for those who cannot attend Think 2022 in person.

One Last Note

61% of People Believe Bots Will Succeed Where Humans Have Failed with Corporate Sustainability

According to a new Oracle-Harvard study. Whether this speaks to the survey respondents’ faith in AI or doubt in humanity …