At Shaip, we're pushing the boundaries of AI innovation through world-class data solutions. We're seeking a highly qualified Computational Linguist (Ph.D. level) with 2-3 years of industry experience to lead transformative projects in ASR model development and LLM fine-tuning.
What Youll Do:
Lead dataset creation for cutting-edge ASR and LLM systems, ensuring
data diversity and alignment with AI model objectives.
Own project lifecycles end-to-end from planning and execution
to delivery working cross-functionally with internal teams and
external clients.
Interface with clients to understand requirements, provide status
updates, and exceed expectations with high-quality deliverables.
Uphold quality standards through robust QA processes, ensuring
data accuracy, consistency, and completeness.
Mentor and manage teams of linguists and annotators, driving
collaboration and innovation.
Provide technical expertise in preprocessing, annotation, and model
evaluation to steer AI model optimization.
What You Bring:
Ph.D. in Computational Linguistics or Linguistics
23 years of experience managing data-driven projects for ASR or LLMs
Deep technical fluency in model architectures, data curation, and
QA methodologies
Strong project and team management skills with proven delivery
track record
Excellent client communication skills and a passion for building
diverse, representative datasets
Bonus Points:
Published research in AI or computational linguistics
Familiarity with annotation tools and transcription software
Commitment to ethical AI practices, including bias mitigation
Keyskills: Linguistics Client Communication Aiml Qa Process AI model optimization