Job Description
Job Summary-
Data Scientist with good hands-on experience of 3+ years in developing state of the art and scalable Machine Learning models and their operationalization, leveraging off-the-shelf workbench production. Job Responsibilities-1. Hands on experience in Python data-science and math packages such as NumPy, Pandas, Sklearn, Seaborn, PyCaret, Matplotlib
2. Proficiency in Python and common Machine Learning frameworks (TensorFlow, NLTK, Stanford NLP, PyTorch, Ling Pipe, Caffe, Keras, SparkML and OpenAI etc.)
3. Experience of working in large teams and using collaboration tools like GIT, Jira and Confluence
4. Good understanding of any of the cloud platform - AWS, Azure or GCP
5. Understanding of Commercial Pharma landscape and Patient Data / Analytics would be a huge plus
6. Should have an attitude of willingness to learn, accepting the challenging environment and confidence in delivering the results within timelines. Should be inclined towards self motivation and self-driven to find solutions for problems.
7. Should be able to mentor and guide mid to large sized teams under him/herJob -
1. Strong experience on Spark with Scala/Python/Java
2. Strong proficiency in building/training/evaluating state of the art machine learning models and its deployment
3. Proficiency in Statistical and Probabilistic methods such as SVM, Decision-Trees, Bagging and Boosting Techniques, Clustering
4. Proficiency in Core NLP techniques like Text Classification, Named Entity Recognition (NER), Topic Modeling, Sentiment Analysis, etc. Understanding of Generative AI / Large Language Models / Transformers would be a plus
Job Classification
Industry: Analytics / KPO / Research
Functional Area / Department: Data Science & Analytics
Role Category: Data Science & Machine Learning
Role: Data Science & Machine Learning - Other
Employement Type: Full time
Contact Details:
Company: Axtria
Location(s): Noida, Gurugram
Keyskills:
scala
java
spark
machine learning algorithms
python
confluence
scikit-learn
nltk
training
numpy
tensorflow
git
seaborn
gcp
pytorch
keras
spark mllib
jira
sentiment analysis
lingpipe
caffe
microsoft azure
pandas
matplotlib
aws
statistics