Difference between revisions of "Data Validation"
(Created page with "[[{{PAGENAME}}]] is the process of ensuring that the data has undergone data cleaning to assure the data quality, and making sure that the data is correct and useful. The proc...") |
|||
(6 intermediate revisions by the same user not shown) | |||
Line 3: | Line 3: | ||
[[{{PAGENAME}}]] is intended to provide certain well-guarantees for fitness and consistency of the data in a certain system. The rules within can be defined using several methods and may be deployed in various contexts. The major implementation of [[{{PAGENAME}}]] are through [[Data Integrity]] rules and Procedure-based Business rules. | [[{{PAGENAME}}]] is intended to provide certain well-guarantees for fitness and consistency of the data in a certain system. The rules within can be defined using several methods and may be deployed in various contexts. The major implementation of [[{{PAGENAME}}]] are through [[Data Integrity]] rules and Procedure-based Business rules. | ||
[[{{PAGENAME}}]] do not necessarily guarantee its accuracy and it is possible for data entry errors to be accepted as valid. | [[{{PAGENAME}}]] do not necessarily guarantee its accuracy and it is possible for data entry errors to be accepted as valid. [[{{PAGENAME}}]] cannot be confused with [[Data Verification]]. [[{{PAGENAME}}]] is a required step to ensure [[Data Integrity]]. | ||
==Data Validation Types== | |||
There are several types depending on the different kinds of validation according to their scope, complexity, and purpose. | |||
===Data-type Check=== | |||
Data type validation typically done on simple data fields. The simplest approach is to make sure every characters from a user input are consistent with the expected characters of specific known data types as intended. For example, an integer field can be set to only accept characters from 0 to 9. | |||
===Range and constraint Check=== | |||
Range and constraint validation examine user input for consistency with maximum/minimum range or consistency with a test for a sequence of characters. For example, a password is required to meet a minimum length and must contain characters from multiple groups (alphabets, numerical, special characters). | |||
===Code and cross-reference Check=== | |||
Code and cross-reference validation verify the consistency of the data with one or more rules, requirements, or collections relevant to a certain context or set of assumptions. For example, a user-provided country code might be required to identify a geopolitical region. | |||
===Structured Check=== | |||
Structured validation combines several others validation, usually it is on a more complex system. | |||
===Consistency Check=== | |||
Consistency validation ensure the logicality of the data. For example, expiry date of a document cannot precedes its starting date. | |||
==Data Validation Methods== | |||
*Allowed Characters Check | |||
*Batch Totals | |||
*Cardinality Check | |||
*Check Digits | |||
*Consistency Check | |||
*Cross-system Consistency Check | |||
*Data Type Check | |||
*File Existence Check | |||
*Format Check | |||
*Presence Check | |||
*Range Check | |||
*Referential Integrity Check | |||
*Spelling and Grammar Check | |||
*Uniqueness Check | |||
*Table-look Check | |||
==Data Validation Ensuring with PKC== | |||
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Quisque ex odio, venenatis eu ex ac, finibus ultrices diam. Donec eu purus lacinia sapien dapibus dictum sed eget diam. Sed vel neque egestas, ultrices est ut, scelerisque dolor. Donec vel erat id enim porta gravida nec in lectus. Morbi hendrerit vel velit eu maximus. In mattis dolor velit, ac suscipit metus vestibulum condimentum. Integer mattis consequat libero, et placerat erat rutrum in. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia curae; Donec maximus, justo in bibendum accumsan, lectus risus imperdiet sapien, eget mollis ipsum eros eu sem. Donec id ultrices nisl. Vivamus vulputate arcu a ipsum maximus tempor et nec ligula. Nam elit ex, pellentesque vitae orci vel, pharetra efficitur neque. Praesent dapibus vulputate nunc, ut vulputate ipsum finibus id. | |||
=References= | |||
<references/> | |||
=Related Pages= | |||
[[Category: Data Science]] |
Latest revision as of 18:34, 12 April 2022
Data Validation is the process of ensuring that the data has undergone data cleaning to assure the data quality, and making sure that the data is correct and useful. The processes within Data Validation are validation rules, validation constraints, and check routines. These processes check the data for its correctness, meaningfulness, and secureness before it is inputted to the system. Data Validation may be implemented through automation (data dictionary), or by the inclusion of application program with validation logic.
Data Validation is intended to provide certain well-guarantees for fitness and consistency of the data in a certain system. The rules within can be defined using several methods and may be deployed in various contexts. The major implementation of Data Validation are through Data Integrity rules and Procedure-based Business rules.
Data Validation do not necessarily guarantee its accuracy and it is possible for data entry errors to be accepted as valid. Data Validation cannot be confused with Data Verification. Data Validation is a required step to ensure Data Integrity.
Data Validation Types
There are several types depending on the different kinds of validation according to their scope, complexity, and purpose.
Data-type Check
Data type validation typically done on simple data fields. The simplest approach is to make sure every characters from a user input are consistent with the expected characters of specific known data types as intended. For example, an integer field can be set to only accept characters from 0 to 9.
Range and constraint Check
Range and constraint validation examine user input for consistency with maximum/minimum range or consistency with a test for a sequence of characters. For example, a password is required to meet a minimum length and must contain characters from multiple groups (alphabets, numerical, special characters).
Code and cross-reference Check
Code and cross-reference validation verify the consistency of the data with one or more rules, requirements, or collections relevant to a certain context or set of assumptions. For example, a user-provided country code might be required to identify a geopolitical region.
Structured Check
Structured validation combines several others validation, usually it is on a more complex system.
Consistency Check
Consistency validation ensure the logicality of the data. For example, expiry date of a document cannot precedes its starting date.
Data Validation Methods
- Allowed Characters Check
- Batch Totals
- Cardinality Check
- Check Digits
- Consistency Check
- Cross-system Consistency Check
- Data Type Check
- File Existence Check
- Format Check
- Presence Check
- Range Check
- Referential Integrity Check
- Spelling and Grammar Check
- Uniqueness Check
- Table-look Check
Data Validation Ensuring with PKC
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Quisque ex odio, venenatis eu ex ac, finibus ultrices diam. Donec eu purus lacinia sapien dapibus dictum sed eget diam. Sed vel neque egestas, ultrices est ut, scelerisque dolor. Donec vel erat id enim porta gravida nec in lectus. Morbi hendrerit vel velit eu maximus. In mattis dolor velit, ac suscipit metus vestibulum condimentum. Integer mattis consequat libero, et placerat erat rutrum in. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia curae; Donec maximus, justo in bibendum accumsan, lectus risus imperdiet sapien, eget mollis ipsum eros eu sem. Donec id ultrices nisl. Vivamus vulputate arcu a ipsum maximus tempor et nec ligula. Nam elit ex, pellentesque vitae orci vel, pharetra efficitur neque. Praesent dapibus vulputate nunc, ut vulputate ipsum finibus id.