Open Studio for Data Quality | Open Studio for Data Integration | Data Integration Entry-Level | Data Management Platform | |
---|---|---|---|---|
License | Free Open Source | Free Open Source | User-based subscription | User-based subscription |
Free open source Apache license | ||||
Subscription license with warranty and indemnification | ||||
Available as downloadable software | ||||
Available as cloud service and downloadable software | ||||
2 free Data Preparation and 2 free Data Stewardship licenses with any Talend subscription | ||||
Design and Productivity Tools | ||||
Graphical design environment | ||||
Team collaboration with shared repository | ||||
Continuous integration / Continous delivery | ||||
Visual mapping for complex JSON, XML and EDI | ||||
Audit, job compare, impact analysis, testing, debugging, and tuning | ||||
Metadata bridge for metadata import/export, and centralized metadata management | ||||
Distant run and parallelization | ||||
Dynamic schema, re-usable joblets, and reference projects | ||||
Repository manager | ||||
ETL and ELT support | ||||
Wizards and interactive data viewer | ||||
Versioning | ||||
Export and execute standalone jobs in runtime environments | ||||
Change data capture (CDC) | ||||
Automatic documentation | ||||
Customizable assessment | ||||
Pattern library | ||||
Cloud Pipeline Designer* | ||||
Components | ||||
File management: open, move, compress, decompress without scripting | ||||
Control and orchestrate data flows and data integrations with master jobs | ||||
Map, aggregate, sort, enrich, and merge data | ||||
Connectors | ||||
Cloud: AWS, Microsoft Azure, Google Cloud Platform, and more | ||||
RDBMS: Oracle, Teradata, Microsoft SQL server, and more | ||||
SaaS: Marketo, Salesforce, NetSuite, and more | ||||
Packaged Apps: SAP, Microsoft Dynamics, Sugar CRM, and more | ||||
Technologies: Dropbox, Box, SMTP, FTP/SFTP, LDAP, and more | ||||
Cleansing, masking and error resolution | ||||
Optional 3rd-party address validation services | ||||
Management and Monitoring | ||||
High availability, load balancing, failover for Jobs | ||||
Deployment manager and team collaboration | ||||
Manage users, groups, roles, projects, and licenses | ||||
Manage execution engines | ||||
Execution plan, time, and event-based scheduler for jobs | ||||
Check points, error recovery | ||||
Context management (dev, QA, prod) | ||||
Activity monitoring | ||||
Log collection and display | ||||
Engine clusters for Jobs* | ||||
Static IP addresses* | ||||
Job execution log history (2 months for Entry products, 3 months for Platforms)* | ||||
Environments (2 for Entry products, unlimited for Platforms)* | ||||
Single Sign-On (SSO) integration with several SSO providers | ||||
Cloud Security Information and Event Management (SIEM), Intrusion Detection System (IDS), Intrusion Prevention System (IPS) and Web Application Firewall (WAF)* | ||||
Data Quality and Governance | ||||
Data profiling and analytics with graphical charts and drilldown data | ||||
Data privacy with masking and encryption | ||||
Automated data standardization, cleansing and rules enforcement | ||||
Data quality portal with monitoring, reporting, and dashboards | ||||
Semantic discovery with automatic detection of patterns | ||||
Comprehensive survivorship | ||||
Data sampling | ||||
Enrichment, harmonization, fuzzy matching, and de-duplication | ||||
Advanced Data Profiling | ||||
Fraud pattern detection using Benford Law | ||||
Advanced statistics with indicator thresholds | ||||
Column set analysis | ||||
Advanced matching analysis | ||||
Time column correlation analysis | ||||
Pipeline Designer* | ||||
Design pipelines in the cloud and run on-premises or in the cloud | ||||
Read/Write support for Snowflake; Amazon Redshift, S3; Azure SQL Database, SQL Data Warehouse, Blob Storage, Data Lake Store Gen2; Amazon RDS (Oracle, SQL Server, MySQL, PostgreSQL, Aurora); and on-premise through JDBC (Oracle, SQL Server, MySQL, MariaDB, PostgreSQL) | ||||
Connectors for SaaS: Saleforce; Streaming: Apache Kakfa, Amazon Kinesis (source only), Azure Event Hubs; NoSQL: Elasticsearch | ||||
Native cloud data warehouse connectors: Snowflake and Amazon Redshift bulk loaders (destination only) | ||||
Lightweight data transformations including filter, flatten/normalize, aggregate, replicate, look up, join, and time windowing | ||||
Live preview of sample data, and pipeline sharing | ||||
Design batch and streaming pipelines in the same interface, using the same connectors | ||||
Schema on-read support | ||||
Easily embed Python code | ||||
Supports data formats including: AVRO, JSON, Parquet, and CSV | ||||
Stores data in shared, common data set repository across all Talend products | ||||
Manage users and licenses, schedule pipelines and monitor status (TMC) | ||||
Data Preparation and Stewardship | ||||
Import, export, and combine data from any database, Excel or CSV file | ||||
Import, export, and combine CSV, Parquet and AVRO files** | ||||
Export to Tableau | ||||
Self-service on-demand access to sanctioned datasets | ||||
Share data preparations and datasets | ||||
Operationalize preparations into any data or big data integration flow | ||||
Run preparations on Apache Beam** | ||||
Auto-discovery, profiling, smart suggestions, and data visualization | ||||
Customization of semantic type for auto-profiling and standardization | ||||
Smart and selective sampling and full-runs | ||||
Data tracking and masking with role-based security | ||||
Cleansing and enrichment functions | ||||
Data Stewardship App for data curation and certification | ||||
Define data models, data semantics and profile data accordingly. Define and apply rules (survivorship, mass updates) | ||||
Merge and match data, resolve data errors, and arbitrate on data (classification and certification) | ||||
Orchestrate and collaborate on activities in campaigns | ||||
Define user roles, workflows and priorities, assign and delegate tasks, tag and comment | ||||
Embed governance and stewardship in data integration flows and manage rejects | ||||
Embed human certification and error resolution into MDM processes | ||||
Take matching decisions that cannot be processed automatically. | ||||
De-duplicate data at scale with machine learning | ||||
Audit and track data error resolution actions. Monitor progress of campaigns. Undo/redo based on business need | ||||
Support | Self-service | Self-service | Web, email | Web, email, phone |
Product Details | Product Details | Product Details | Product Details |