Job Details:
Job Description:
Attention and KVCache are the core LLM component. Reduce computation and storage in attention and KVCache will boost inference performance on HW platform. It is critical to explore the various algorithm alternatives to build our technical strength on Xeon as HN and AI appliance solution. We are looking for passionate Intern talent to research and develop sparsity attention and KV Cache compression algorithms for LLM in Intel Platform.
Qualifications:
� Master's or Ph.D. student in Computer Science, Artificial Intelligence, Software Engineering, or related fields
� Strong background in deep learning, LLMs, or NLP
� Familiarity with model compression techniques (e.g., quantization, pruning)
� Proficiency in Python or other programming languages
� Passion for technology and innovation, with a strong drive to explore and push boundaries
� Experience with LLMs (e.g., Mistral, LLaMA, Qwen) is a plus
� Knowledge of hardware acceleration (e.g., GPU, TPU, or custom AI accelerators) is a good plus
� At least 3 days per week, able to commit to a one-year internship
Location: Shanghai
Job Type:
Student / Intern
Shift:
Shift 1 (China)
Primary Location:
PRC, Shanghai
Additional Locations:
Business group:
The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.
Posting Statement:
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
Position of Trust
N/A
Work Model for this Role
This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.