Modern C/C++ programming experience
Multi-core/multi-threaded programming experience
Familiarity with deep learning networks: CNN, LSTM, and RNNs
Embedded systems programming experience with basic understanding of computer architecture
Experience in GPU programming/compiler/backend CUDA/OpenCL/ROCm or other deep learning hardware accelerators.
Experience in parallel computation
Creating AI framework for custom chip
Delivering highly optimized, documented, clean and maintainable C/C++ code
Suggesting technical and functional improvements to add value to the product
Our client is an innovative startup from Silicon Valley which is operating in stealth-mode building next-generation semiconductor chips targeted for datacenters and future Edge Computing deployment.
Our customers usually range from startup to high growth and VC backed companies, which drives a culture of acceleration and innovation. We are sure that team extension is the only engagement model that works best. You are looking for your next big thing, aren’t you? So are we, hoping it’s you! Jump at your chance! You will have a unique opportunity to participate in the one of the most innovative projects to build a chip for AI acceleration
You will have a chance to:
Work in the international team for next generation of AI acceleration chips
Deep dive into AI solutions
Grow your experience in writing highly optimized code