Free Online Certification Courses – Learn Today. Lead Tomorrow. › Forums › Apache Hadoop › what will be the validation in sqoop command
- This topic has 1 reply, 1 voice, and was last updated 5 years, 7 months ago by DataFlair Team.
-
AuthorPosts
-
-
September 20, 2018 at 12:41 pm #4897DataFlair TeamSpectator
what will be the validation in sqoop command
-
September 20, 2018 at 12:41 pm #4899DataFlair TeamSpectator
Sqoop Validation
A simple definition of Sqoop validation validates the data copied. By comparing the row counts from the source as well as the target post copy, either import or Export .Especially, to compare the row counts between source as well as the target just after data imported into HDFS, we use this option.
All the rows are deleted or added, while during the imports, though, Sqoop tracks this change. Moreover, it updates the log file.
Interfaces of Sqoop Validation
3 interfaces of Sqoop Validation are:a. ValidationThreshold
In order to determine whether the error margin between the source and target are acceptable(Absolute, Percentage Tolerant and many more) we use the ValidationThreshold. Although, its default implementation is AbsoluteValidationThreshold.Basically, that ensures that the row counts from source as well as targets are the same.
b. ValidationFailureHandler
Also, it has once interface with ValidationFailureHandler, that is responsible for handling failures here. Such as log an error/warning, abort and many more. Although default implementation is LogOnFailureHandler. Here that logs a warning message to the configured logger.c. Validator
Generally, Validator drives the validation logic, by delegating the decision to ValidationThreshold.However to learn about Sqoop validation in detail, follow the link: Sqoop Validation – Interfaces & Limitations of Sqoop Validate
-
-
AuthorPosts
- You must be logged in to reply to this topic.