Foundation Models for Data Research Intern: 2025
- San Jose, CA, USCambridge, MA, USAlbany, NY, US
- Research
- Internship
Foundation Models for Data Research Intern: 2025
- San Jose, CA, USCambridge, MA, USAlbany, NY, US
- Research
- Internship
IBM Research Scientists are charting the future of Artificial Intelligence, creating breakthroughs in quantum computing, discovering how blockchain will reshape the enterprise, and much more. Join a team that is dedicated to applying science to some of today’s most complex challenges, whether it’s discovering a new way for doctors to help patients, teaming with environmentalists to clean up our waterways or enabling retailers to personalize customer service.
Your Role and Responsibilities
We are broadly interested in further improving the capabilities of foundation models (FMs) for a range of data management tasks such as data discovery, metadata enrichment, data access and retrieval with querying, and automated data-driven insights.
Topics of interest include research on interactive orchestration of data workflows such as natural language to data insights spanning multiple tools and functions, knowledge-driven data discovery and querying with graphs and mutli-modal FMs, step-by-step planning and reasoning for complex data workflows , and low-computational cost inference techniques for FMs to efficiently automate or assist users with data tasks.
We are looking for interns with skills and tasks of interest include:
- [LLM for code generation] Research for effective use of foundational models for code generation pipelines specific to data tasks such as SQL for data retrieval
- [Agents and Reasoning] Research for developing novel autonomous agentic systems to compete with Text-to-SQL on public leaderboards like BIRD and Spider 2.0
- [Knowledge Graphs, Multi-Modal FMs] Research for novel ways to combine foundational models, knowledge graphs, and multi-modal data for improving tasks such as data discovery and automated text-to-sql
- [FM Inference] Research for improving foundation models inference in terms of both answer generation and computational cost.
Required Technical and Professional Expertise
- Applicants should be PhD & MS students pursuing graduate studies.
- Pursuing graduate studies in computer science and related fields.
- Having at least one Research publication at a top conference in AI.
- Familiarity and working expertise with large language models.
Preferred Technical and Professional Expertise
- Familiarity with knowledge graphs, RAG, agentic frameworks.
- Familiarity with reinforcement learning, knowledge distillation and prompt optimization.
- Familiarity with SQL.
Quer saber como é ser um IBMista?
About IBM
IBM’s greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.
Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we’re also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business.
At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it’s time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.
Detalhes importantes do cargo
Não encontrou uma oportunidade para este momento?
Não se preocupe. Junte-se à nossa Rede de Talentos e receba notícias sobre as últimas oportunidades.