SONY Work From Home Internships 2025 :-

SONY is hiring candidates for the Internship Roles for the Bengaluru, Karnataka, India (Work From Home) Locations. The complete details about SONY Work From Home Internships 2025 are as follows.

Company’s Hiring:-SONY
Required Education:Bachelor’s or Master’s Degree
Job Roles:1) Speech Recognition Intern
2) Multimodal AI Intern
Job Type:Work From Home – Internship
Job Role 1: Speech Recognition Intern

Location: Bengaluru, Karnataka, India

Required Qualifications:

Currently pursuing/completed Masters in (Research) or Ph.D. in deep learning/machine learning with hands-on experience on Transformer models with an applications audio/speech.

Must Have Skills:

  • Strong programming skills in Python, and familiarity with PyTorch or TensorFlow.
  • Experience with speech processing libraries (e.g., Torchaudio, ESPnet, Hugging Face Transformers).
  • Prior experience with ASR models like Wav2Vec2, Whisper, or RNN-T is a plus.
  • Ability to read and implement academic papers.
  • Strong foundation in machine learning and signal processing.

Good to have skills:

  • Familiarity with prompt tuning, contrastive learning, or multi-modal architectures.
  • Experience with evaluating hallucinations or generating synthetic speech/audio perturbations.

Key Responsibilities:

Sony Research India is seeking a dynamic and motivated Speech Recognition Intern to join our innovative research team. As an intern, you will work on real-world problems in automatic speech recognition (ASR), focusing on improving noise robustness and reducing hallucinations in transcription outputs. You’ll gain hands-on experience with state-of-the-art tools and datasets, and contribute to impactful projects alongside experienced researchers and engineers.

  • Explore and develop techniques to enhance ASR robustness under noisy, low-resource, and domain-shifted conditions.
  • Investigate hallucination phenomena in end-to-end ASR models (e.g., Whisper, Wav2Vec2, etc.) and propose mitigation strategies.
  • Conduct experiments using large-scale speech datasets and evaluate ASR performance across varying noise levels and linguistic diversity.
  • Contribute to publications, technical reports, or open-source tools as outcomes of the research.
Apply Link :-Click Here To Apply
Job Role 2: Multimodal AI Intern

Location: Bengaluru, Karnataka, India

Required Qualifications:

Candidates pursuing or has completed MS/MSc/MTech/PhD level degree in Computer Science, Electronics Engineering, Data Science, Information Science, Artificial Intelligence, Computer Applications or other closely related technical discipline, will be considered for the internship program.

Skills required:

  • Strong knowledge and relevant programming experience with Python, PyTorch, and Scikit-Learn and other ML and DL libraries, mainly for Multimodal AI tasks.
  • Demonstratable skills of successfully applying state-of-the-art machine learning and deep neural networks-based models to multimodal problems, employing combination of audio, textual and visual data.
  • Detailed understanding of all main network architectures, deployment modes, data augmentation and preparation, and theoretical performance analysis of model architectures.
  • Strong analytical and problem-solving skills with knowledge of algorithms, signal processing, mathematics for machine learning, probability, statistics and linear algebra.
  • Excellent interpersonal skills with effective communication and presentation skills

Good to have Skills:

  • Demonstrate research capabilities through relevant publication record in leading journals and conferences.
  • Impact driven mindset and ability to work and learn in a collaborative and diverse environment.
  • Programming knowledge of using data structure and OOP concepts proficiently in Python, C and C++.
  • Programming experience with Scikit-Image, OpenCV with Python and MATLAB would be desirable.
  • Familiarity with Linux operating system.
  • Participation in Kaggle and other open-source grand challenge competitions.

For this internship program, we are seeking a talented and motivated candidate who is pursuing or has completed MS/MSc/MTech/PhD level degree, to join us as soon as possible, for a duration of at least 6 months.

For this internship program, we’re only going to consider candidates who have demonstrable skills and knowledge in Multimodal AI using speech, textual and vision modalities, with strong software development skills in PyTorch and Python.

Apply Link :-Click Here To Apply
Apply for Other Off-Campus Jobs
Off-Campus JobsApply Link
IBMClick here
SwiggyClick here
American ExpressClick here
CienaClick here
ZohoClick here
WhatsAppJoin us on
WhatsApp!