tgoop.com/itfreelancers/5143
Last Update:
‼️‼️🆕 #lookfor #Data Scientist #outstaff
Привіт всім,
Шукаємо спеціаліста з наступним досвідом:
*Data Scientist (Project-Based)*
Project Duration: 2-3 weeks
Project Description: We are seeking a Data Scientist for a short-term project aimed at developing a microservice for a data analytics firm. The project involves creating a solution to match column names across different data tables, essential for data consolidation and migration tasks. The microservice will address non-matching column names by using word embedding techniques to transform them into vector representations, calculating similarities, and applying the Hungarian algorithm for optimal pairing.
For example, if one table has a column named “CustomerID” and another table has a similar column named “Cust_ID,” our microservice will use NLP techniques to identify these columns as a match.
Responsibilities:
• Implement word embedding conversion using pre-trained models such as word2vec, GloVe, and BERT.
• Develop algorithms for data collection, correlation, and standardization.
• Analyze datasets to extract actionable insights and detect patterns.
• Compute similarity matrices and construct distance matrices for column name matching.
• Apply thresholds to detect mismatches and identify unique column names.
• Collaborate with the team to ensure the prototype meets operational requirements.
Required Skills:
• Expertise in pattern recognition and data analysis.
• Advanced knowledge of word embedding techniques and similarity matrix construction.
• Proficiency in Python and Linux-based containerized environments.
• Experience with machine learning, NLP, and algorithmic matching.
• Familiarity with the Hungarian algorithm and mismatch detection.
• Ability to work with complex datasets and perform statistical analysis.
• Experience with Docker and API development.
📩 Please send CV, hourly rate, location and availability to @k_tiupa
✖️No RF and RB ‼️‼️
BY IT freelance and remote
Share with your friend now:
tgoop.com/itfreelancers/5143