What will you do?
- Quantize, prune, distill, finetune Gen AI models to optimize for edge platforms
- Fundamentally understand Amazon's underlying Neural Edge Engine to invent optimization techniques
- Analyze deep learning workloads and provide guidance to map them to Amazon's Neural Edge Engine
- Use first principles of Information Theory, Scientific Computing, Deep Learning Theory, Non Equilibrium Thermodynamics
- Train custom Gen AI models that beat SOTA and paves path for developing production models
- Collaborate closely with compiler engineers, fellow Applied Scientists, Hardware Architects and product teams to build the best ML-centric solutions for our devices
- Publish in open source and present on Amazon's behalf at key ML conferences - NeurIPS, ICLR, MLSys.
We are open to hiring candidates to work out of one of the following locations:
Sunnyvale, CA, USA
Basic Qualifications
- 3+ years of building machine learning models for business application experience
- PhD, or Master's degree and 6+ years of applied research experience
- Experience programming in Java, C++, Python or related language
- Experience with neural deep learning methods and machine learning
Preferred Qualifications
- Experience with modeling tools such as R, scikit-learn, Spark MLLib, MxNet, Tensorflow, numpy, scipy etc.
- Experience with large scale distributed systems such as Hadoop, Spark etc.