Move your data to the cloud destination of choice where you can start running AI and advanced analytics faster with no downtime.
Data Migrator is a fully automated solution that moves on-premise HDFS data, Hive metadata, local filesystem, or cloud data sources to any cloud or on-premises environment, even while those datasets are under active change. Data Migrator requires zero changes to applications or business operations. Moving data of any scale can begin immediately and be performed without production system downtime or business disruption, and with zero risk of data loss.
Data Migrator is installed on an edge node of your Hadoop cluster. Deployment can be performed in minutes without impacting current operations, so users can begin moving data immediately.
Existing datasets can be moved with a single pass through the source storage system, eliminating the CPU cycles and overhead associated with multiple scans, while also supporting continuous migration of any ongoing changes from source to target with zero disruption to current production systems.
Data Migrator supports HDFS distributions v2.6 and higher as source systems, as well as leading cloud service providers and select independent software vendors, such as Databricks and Snowflake, as target systems. See Data Migrator documentation for further details.
Data Migrator supports migration of HDFS data and Hive metadata to any public cloud and on-premises environments.
Datasets of any size — from terabytes to multiple petabytes — can be moved without affecting production environments. Horizontal scaling capabilities allow users to scale their migration capacity by configuring transfer agents to maximize the productivity of available bandwidth.
Cirata browser-based user interface (UI) lets users manage the entire data and metadata migration from a single management console.
Migrations can also be managed through a comprehensive and intuitive command-line interface or by using the self-documenting representational state transfer API to integrate the solution with other programs as needed.
Organizations can configure migration jobs to meet their specific needs, such as defining sources, targets, and which data to migrate. There are also advanced capabilities, such as migration prioritization, path mapping, and network bandwidth-management controls.
Data Migrator contains a data transfer verification function that scans both source and target environments to ensure data fidelity and validate the success of all data transfers. Results and reports are delivered through the UI or by email.
Users are updated on migration jobs, from health and status metrics providing estimates for migration completion to email notifications and real-time insights regarding usage enabling hands-off operations.