Foundation Models for Data Research Intern: 2025
- Yorktown HeightsSan JoseCambridgeAlbany
- Research
- Internship
Foundation Models for Data Research Intern: 2025
- Yorktown HeightsSan JoseCambridgeAlbany
- Research
- Internship
IBM Research Scientists are charting the future of Artificial Intelligence, creating breakthroughs in quantum computing, discovering how blockchain will reshape the enterprise, and much more. Join a team that is dedicated to applying science to some of today’s most complex challenges, whether it’s discovering a new way for doctors to help patients, teaming with environmentalists to clean up our waterways or enabling retailers to personalize customer service.
Your Role and Responsibilities
We are broadly interested in further improving the capabilities of foundation models (FMs) for a range of data management tasks such as data discovery, metadata enrichment, data access and retrieval with querying, and automated data-driven insights.
Topics of interest include research on interactive orchestration of data workflows such as natural language to data insights spanning multiple tools and functions, knowledge-driven data discovery and querying with graphs and mutli-modal FMs, step-by-step planning and reasoning for complex data workflows , and low-computational cost inference techniques for FMs to efficiently automate or assist users with data tasks.
We are looking for interns with skills and tasks of interest include:
- [LLM for code generation] Research for effective use of foundational models for code generation pipelines specific to data tasks such as SQL for data retrieval
- [Agents and Reasoning] Research for developing novel autonomous agentic systems to compete with Text-to-SQL on public leaderboards like BIRD and Spider 2.0
- [Knowledge Graphs, Multi-Modal FMs] Research for novel ways to combine foundational models, knowledge graphs, and multi-modal data for improving tasks such as data discovery and automated text-to-sql
- [FM Inference] Research for improving foundation models inference in terms of both answer generation and computational cost.
Required Technical and Professional Expertise
- Applicants should be PhD & MS students pursuing graduate studies.
- Pursuing graduate studies in computer science and related fields.
- Having at least one Research publication at a top conference in AI.
- Familiarity and working expertise with large language models.
Preferred Technical and Professional Expertise
- Familiarity with knowledge graphs, RAG, agentic frameworks.
- Familiarity with reinforcement learning, knowledge distillation and prompt optimization.
- Familiarity with SQL.
Want to know what it’s like to be an IBMer?
Key Job Details
Don’t see a fit at this time?
Don’t worry. Join our Talent Network and get notified about the latest opportunities.