Data Migrator

Organizations are modernizing their on-premises data architectures that were implemented using technologies such as Hadoop or Spark to more innovative solutions in the cloud. Doing so, however, raises challenges due to the size of the data, the amount of data under active change, and the potential for disrupting these environments.

Data Migrator explained

Move your data to the cloud destination of choice where you can start running AI and advanced analytics faster with no downtime.

Data Migrator is a fully automated solution that moves on-premise HDFS data, Hive metadata, local filesystem, or cloud data sources to any cloud or on-premises environment, even while those datasets are under active change. Data Migrator requires zero changes to applications or business operations. Moving data of any scale can begin immediately and be performed without production system downtime or business disruption, and with zero risk of data loss.

Read datasheet

Benefits of Cirata Data Migrator

Avoid cloud lock-in

Cloud storage platform agnostic
Integrates with all major cloud storage services and analytic technologies
Choose one or multiple cloud storage destinations

Business continuity

Requires no production system downtime
Requires zero changes to source applications
Provides immediate availability of migrated data

Lower costs

Automates migration to minimize the need for IT resources
Requires zero custom-code development or maintenance
Provides faster time-to-value and adoption of AI and machine learning

Cirata Data Migrator automates the movement of data to the cloud

The following capabilities enable zero business disruption, reduced risk, and best time-to-value.

Quick deployment and operation

Data Migrator is installed on an edge node of your Hadoop cluster. Deployment can be performed in minutes without impacting current operations, so users can begin moving data immediately.

Complete and continuous migration

Existing datasets can be moved with a single pass through the source storage system, eliminating the CPU cycles and overhead associated with multiple scans, while also supporting continuous migration of any ongoing changes from source to target with zero disruption to current production systems.

Support for multiple sources and targets

Data Migrator supports HDFS distributions v2.6 and higher as source systems, as well as leading cloud service providers and select independent software vendors, such as Databricks and Snowflake, as target systems. See Data Migrator documentation for further details.

Transfer Hadoop data and Hive metadata

Data Migrator supports migration of HDFS data and Hive metadata to any public cloud and on-premises environments.

Data transfer at any scale

Datasets of any size — from terabytes to multiple petabytes — can be moved without affecting production environments. Horizontal scaling capabilities allow users to scale their migration capacity by configuring transfer agents to maximize the productivity of available bandwidth.

Easy management

Cirata browser-based user interface (UI) lets users manage the entire data and metadata migration from a single management console.

Programmatic interface

Migrations can also be managed through a comprehensive and intuitive command-line interface or by using the self-documenting representational state transfer API to integrate the solution with other programs as needed.

Flexible configurations and precise control

Organizations can configure migration jobs to meet their specific needs, such as defining sources, targets, and which data to migrate. There are also advanced capabilities, such as migration prioritization, path mapping, and network bandwidth-management controls.

Transfer verification

Data Migrator contains a data transfer verification function that scans both source and target environments to ensure data fidelity and validate the success of all data transfers. Results and reports are delivered through the UI or by email.

Powerful metrics and real-time monitoring

Users are updated on migration jobs, from health and status metrics providing estimates for migration completion to email notifications and real-time insights regarding usage enabling hands-off operations.

“Data Migrator handles everything in the background and doesn't require expertise from the customer. It's as close to a silver bullet as you can find for large scale Hadoop migration.”

— Merv Adrian, Former Vice President of Data and Analytics, Gartner Research

Powerful cloud-connected data use cases

Hadoop data migration

Rapidly make the shift away from legacy data technologies and underutilized datasets to more advanced and capable data platforms in the cloud, including AI, ML, and advanced analytics.

Learn more

Disaster recovery

Ensure critical data assets are readily available by seamlessly maintaining a replica of data lake environments and actively-used data in secondary locations (either cloud or on-premises).

Learn more

Hybrid cloud

Easily implement flexible data architectures that maintain data in hybrid environments including on-premises, cloud, multi-cloud, and intercloud deployments.

Learn more

Cirata Data Migrator

Easily move your data and metadata to any cloud with no downtime and no business disruption.

Data Migrator explained

Benefits of Cirata Data Migrator

Cirata Data Migrator automates the movement of data to the cloud

Powerful cloud-connected data use cases

Explore our in-depth demo videos using Data Migrator.

Featured resources