DataCleaner is an Open Source application for profiling, validating and comparing data. These activities help you administer and monitor your data quality in order to ensure that your data is useful and applicable to your business situation.
DataCleaner is the free alternative to software for master data management (MDM) methodologies, data warehousing (DW) projects, statistical research, preparation for extract-transform-load (ETL) activities and more.
New: This wiki page is no longer to be considered the main homepage of DataCleaner! This serves as a community place for documentation collaboration and ad hoc writing of snippets and small pieces of information.
Please be sure to visit the new official DataCleaner website at http://datacleaner.org
- User guide
- Design documentation / developers guide
- Database configuration examples: MySQL | Postgresql | | Oracle | Microsoft SQL Server | more examples at the DataCleaner features page.