Multimodal AI Intern

๐Ÿข Sony Research India ๐Ÿ“ Remote ๐Ÿ’ฐ 35k-40k Internship
Internship2026 Batch
๐Ÿ“… Posted 2h ago

๐Ÿ“„ Job Description

About Sony Research India

Sony Research India is driving cutting-edge research and development globally, including laboratories in Japan, the United States, Europe, and Asia. We endeavor to create new technology, products, and services while sustaining Sony Groupโ€™s diverse businesses in electronics, entertainment, and financial fields. We foster the growth of a diverse pool of research and engineering talent to create a technology talent bank to drive research excellence worldwide. Sony Research India offers outstanding career opportunities around frontline technologies such as AI and data analytics.

About the Role

Internships at Sony Research India (SRI) aim to offer students an opportunity for industry exposure and production-level project experience. The primary responsibility in this role will be to positively contribute to the growth and development of innovative technologies at SRI. During the internship, you will work closely with research scientists and other team members on various development and optimization tasks in speech, audio, textual, linguistic, and visual signal analysis. In addition, youโ€™ll be expected to actively work on AI development activities daily, including training, testing, fixing bugs, or other routines for building a robust model.

We are seeking a talented and motivated candidate who is pursuing or has completed an MS/MSc/MTech/PhD level degree, to join us as soon as possible for a duration of at least 6 months. For this internship program, weโ€™re only going to consider candidates who have demonstrable skills and knowledge in Multimodal AI using audio, text, and vision modalities, with strong software development skills using PyTorch and Python.

Work Location & Duration

  • Work Location: Remote within India.
  • Duration: The paid internship will be for 6 months, ideally starting from the first week of August 2026.
  • Working Hours: 9:00 IST to 18:00 IST (Monday to Friday) full-time.

Eligibility & Qualifications

Candidates pursuing or who have completed an MS/MSc/MTech/PhD level degree in Computer Science, Electronics Engineering, Data Science, Information Science, Artificial Intelligence, Computer Applications, or other closely related technical disciplines, will be considered for the internship program.

Required Skills

  • Strong knowledge and relevant programming experience with Python, PyTorch, and Scikit-Learn, and other ML and DL libraries, mainly for Multimodal AI tasks.
  • Demonstrable skills in successfully applying state-of-the-art machine learning and deep neural networks-based models to multimodal problems, including the combination of audio and video technologies.
  • Detailed understanding of all main network architectures, deployment modes, data augmentation and preparation, and theoretical performance analysis of model architectures.
  • Strong analytical and problem-solving skills with knowledge of algorithms, signal processing, mathematics for machine learning, probability, statistics, and linear algebra.
  • Excellent interpersonal skills with effective communication and presentation skills.

Good to Have Skills

  • Demonstrable research capabilities through relevant publication records in leading journals and and conferences.
  • Impact-driven mindset and ability to work and learn in a collaborative and diverse environment.
  • Programming knowledge of using data structures and OOP concepts proficiently in Python, C, and C++.
  • Programming experience with Scikit-Image, OpenCV with Python, and MATLAB.
  • Familiarity with the Linux operating system.
  • Participation in Kaggle and other open-source grand challenge competitions.
Apply Now โ†’