For a leading online business directory, we have improved the quality of datasets and enrich them for her Search Engine.
The assignment:
• Analyze the Datasets of tens of millions of data points to correct issues, errors, black holes…
• Data quality audit to identify root causes
• Find how to improve radically the datasets
• Data cleansing to improve the data quality
• Data retrieving & collection: Crawling, scrapping, structuring and analysis to increase the data quantity
• Algorithms and Machine Learning for data curation to improve the data quantity AND quality
Key Points:
Data management for tens of millions of Data
Creating & Combining techniques to clean data and increase the datasets
Improve the Marking-Points and Classification Trees for better search results