We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Edge AI Development Engineer/Researcher

Aleron
United States, Texas, Austin
Jun 05, 2025


Description
The AI team enables state-of-the-art ML and DL model development across our hardware portfolio, using sophisticated model compression techniques to deploy previously impractical AI tasks to battery-powered environments. Our team of data scientists research neural architectures best suited to our customer's needs, select those models most amenable to deployment on our platform, and train them carefully tuning for memory, compute, and energy constraint tradeoffs. Finally, we publish our findings and socialize them via conferences, workshops, and publications.
Beyond a healthy obsession with computational efficiency, the successful candidate will be comfortable with operating in a "version zero' environment, marshaling internal, open source, and third-party resources to solve our customer's problems quickly and elegantly.
Requirements
  • Identify, refine, and/or develop sophisticated ML and DL models for deployment on highly constrained environments.
  • Train models using SOTA compression techniques to fit in specific memory, compute, and power envelopes, making trade-offs between compression and accuracy.
  • Publish and maintain these models in a Model Zoo, including, documentation, and other assets needed by our customers to bootstrap their internal AI features.
  • Socialize their achievements via conferences, meetups, workshops, and publications.
Job Requirements
Required Skills / Qualifications:
Master's Degree
Minimum 2 years experience pruning, distillation, quantization approaches for CNNs and RNNs
Minimum 2 years experience with TensorFlow and Pytorch
Minimum 1 year experience with embedded systems
Preferred Skills / Qualifications:
PhD
  • Experience with one or more of the following AI task domains: audio classification, speech, vision, and/or time series tasks, including domain-specific feature extraction related to those tasks
  • Tensorflow (TFLite, TFLite for Microcontrollers, and/or PyTorch are a plus)
  • Dataset creation and curation
Bonus Qualifications
  • Past "TinyML" involvement or experience
  • Experience developing and optimizing for TFLite for Microcontrollers
  • Experience with embedded C/C++ environments
  • Experience with compression of attention-based architectures
  • Experience with model-to-binary compilers (IREE, MicroTVM, etc)
  • Experience with ONNX, TOSA, Jax, LLVM, and/or MLIR
  • Experience with optimizing for heterogenous AI compute (e.g. CPU+NPU+DSP)
Aleron companies (Acara Solutions, Aleron Shared Resources, Broadleaf Results, Lume Strategies, TalentRise, Viaduct) are an Equal Opportunity Employer. Race/Color/Gender/Religion/National Origin/Disability/Veteran.
Applicants for this position must be legally authorized to work in the United States. This position does not meet the employment requirements for individuals with F-1 OPT STEM work authorization status.

Apply

Applied = 0

(web-696f97f645-sxsds)