Roles and Responsibilities:
As a, Associate Manager - Senior Data scientist you will solve some of the most impactful business problems for our clients using a variety of AI and ML technologies. You will collaborate with business partners and domain experts to design and develop innovative solutions on the data to achieve predefined outcomes.
Qualification:
Bachelor's or Master's degree in a quantitative field (CS, machine learning, mathematics, statistics) or equivalent experience.
5+ years of experience in data science, building hands-on ML models
Experience with LMs (Llama (1/2/3), T5, Falcon, Langchain or framework similar like Langchain)
Candidate must be aware of entire evolution history of NLP (Traditional Language Models to Modern Large Language Models), training data creation, training set-up and finetuning
Candidate must be comfortable interpreting research papers and architecture diagrams of Language Models
Candidate must be comfortable with LORA, RAG, Instruct fine-tuning, Quantization, etc.
Experience leading the end-to-end design, development, and deployment of predictive modeling solutions.
Excellent programming skills in Python. Strong working knowledge of Pythons numerical, data analysis, or AI frameworks such as NumPy, Pandas, Scikit-learn, Jupyter, etc.
Advanced SQL skills with SQL Server and Spark experience.
Knowledge of predictive/prescriptive analytics including Machine Learning algorithms (Supervised and Unsupervised) and deep learning algorithms and Artificial Neural Networks
Experience with Natural Language Processing (NLTK) and text analytics for information extraction, parsing and topic modeling.
Excellent verbal and written communication. Strong troubleshooting and problem-solving skills. Thrive in a fast-paced, innovative environment
Experience with data visualization tools PowerBI, Tableau, R Shiny, etc. preferred
Experience with cloud platforms such as Azure, AWS is preferred but not required
Keyskills: Generative Ai Data Science Predictive Modeling python Large Language Model Natural Language Processing RAG Statistics Machine Learning sql