Jobs / Intel / Sparsity Attention/KVCache Compression algorithm Intern
chevron_leftBack
Sparsity Attention/KVCache Compression algorithm Intern
Intel
placePRC, Shanghai
Posted on Intel website on 31 Mar 2025 (19 days ago)
Intel logo

Job Details:

Job Description: 

Attention and KVCache are the core LLM component. Reduce computation and storage in attention and KVCache will boost inference performance on HW platform. It is critical to explore the various algorithm alternatives to build our technical strength on Xeon as HN and AI appliance solution. We are looking for passionate Intern talent to research and develop sparsity attention and KV Cache compression algorithms for LLM in Intel Platform.

Qualifications:

� Master's or Ph.D. student in Computer Science, Artificial Intelligence, Software Engineering, or related fields � Strong background in deep learning, LLMs, or NLP � Familiarity with model compression techniques (e.g., quantization, pruning) � Proficiency in Python or other programming languages � Passion for technology and innovation, with a strong drive to explore and push boundaries � Experience with LLMs (e.g., Mistral, LLaMA, Qwen) is a plus � Knowledge of hardware acceleration (e.g., GPU, TPU, or custom AI accelerators) is a good plus � At least 3 days per week, able to commit to a one-year internship Location: Shanghai

Job Type:

Student / Intern

Shift:

Shift 1 (China)

Primary Location: 

PRC, Shanghai

Additional Locations:

Business group:

The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Work Model for this Role

This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.
chevron_leftBack to Jobs
Intel logo
Intel Corporation is an American multinational corporation and technology company headquartered in Santa Clara, California, and incorporated in Delaware. Intel designs, manufactures, and sells computer components such as CPUs and related products for business and consumer markets.
Websitelaunch
Careerslaunch