Senior Staff AI Researcher (Neural Network & LLM Optimizatio …, City of Sydney
-
City of Sydney, Australia
-
Posted: less than a week ago
-
Save
- Inference & Compute Optimization: Design and implement highly optimized inference pipelines and computational kernels to accelerate LLM and neural network workloads, leveraging low‑level techniques such as SIMD vectorization, cache‑aware memory access patterns, and hardware‑specific tuning.
- Neural Network Compression & Model Optimization: Research and implement pruning, quantization, and other compression techniques to reduce model size and accelerate inference while preserving accuracy. Apply both in‑training and post‑training optimization methods across LLM and vision model workloads.
- Profiling & Observability: Build and utilize advanced profiling tools to identify bottlenecks across the inference and training stack—from memory bandwidth and cache utilization to CPU‑side data preprocessing stalls and end‑to‑end pipeline throughput.
- Evaluation & Benchmarking: Design and maintain rigorous evaluation and benchmarking frameworks for systematic model comparison across optimization configurations. Develop automated pipelines (e.g., LLM‑as‑a‑judge) to measure the impact of optimization techniques on model quality and performance.
- Mentorship: Act as a technical lead for engineers and researchers, fostering a culture of high‑performance code, rigorous benchmarking, and research‑to‑production excellence. Drive team growth, technical interviews, and cross‑functional collaboration. Required Qualifications
- Deep Systems Expertise: 8+ years of experience in high‑performance computing, AI systems, or low‑level software optimization. Deep familiarity with performance‑critical development including CPU/GPU architecture, memory hierarchies, SIMD/vectorization, and profiling‑driven tuning.
- LLM & NN Optimization Track Record: Proven experience optimizing neural networks and LLMs through techniques such as pruning, quantization, and inference acceleration, with a demonstrated path from research to production deployment.
- Communication: Ability to translate complex systems‑level constraints and optimization trade‑offs into actionable research directions for modeling and engineering teams.
- Experience building evaluation frameworks, ML observability, or developer tools that help researchers understand and compare model performance across optimization configurations.
- A history of working on neural network compression, inference acceleration, or applied AI research problems that required bridging algorithmic research with high‑performance implementation.
- Patent authorship or published research in AI/ML optimization.
- Experience with C/C++ inference engines, x86 intrinsics, or similar low‑level performance work is a strong plus. #J-18808-Ljbffr Apply on Kit Job: kitjobau.com/job/3qsgk2
-
Company nameGlasswing
-
Job positionSenior Staff AI Researcher (Neural Network & LLM Optimization) (City of Sydney)
Senior Staff AI Researcher (Neural Network & LLM Optimizatio … has been posted in the Sydney Education & Training category on Locanto.
Why not check out other ads in this category, such as Knowledge Manager, Sydney NSW Australia, Teacher Host Families (Current or Retired Educators), Sydney or Educational AI Writing Tool for Students, Teachers & Learners in Sydney. In total, we have 4 ads in Education & Training in Sydney on Locanto classifieds.
You can find the Education & Training category under Jobs. Want something else? Check out the related categories Transportation & Logistics, Retail, Food & Wholesale and Healthcare, Beauty & Wellness Sydney.
Interested in more? Widen your search to view ads in nearby areas of Sydney. This includes Education & Training in Woolloomooloo, Potts Point and Kings Cross. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.