Engineering the
Silicon Synapse.

Senior Staff Engineer at Google. Bridging the gap between deep learning at scale and neuromorphic hardware. Architecting systems where software mimics biology.

Leto Hillza Profile
ACTIVE

Core Directives

01. Biomimicry

Nature has had 3.5 billion years to optimize compute. We shouldn't reinvent the wheel; we should translate biology into silicon.

02. Asynchronicity

The world doesn't wait for a clock cycle. Efficient systems must be event-driven, reacting only when information changes.

03. Scalability

A beautiful algorithm is useless if it can't run at Google scale. Reliability and throughput are first-class citizens.

Key Architectures

Ads Infrastructure

Budgeting Optimization System (BOS)

Architected the real-time dynamic allocation engine for Google Ads. Utilized Reinforcement Learning to adjust spend pacing across millions of concurrent campaigns, balancing delivery guarantees with ROI.

R&D / Hardware

Hybrid SNN Edge Runtime

Currently leading the design of a low-power runtime for Spiking Neural Networks. Enabling event-based processing on edge devices, reducing power consumption by 40% compared to standard ANN inference.

NEURAL_ARCHIVE_V1

Declassified Concepts

Hover over data cards to decrypt technical definitions.

SNN

SPIKING NEURAL NET

The "Brain-Like" AI

Standard AI processes data continuously. SNNs only "fire" when a spike occurs—just like biological neurons. This event-driven method drastically cuts energy use.

RL

REINFORCEMENT LEARNING

Trial & Error at Scale

An "Agent" learns to make decisions by interacting with an environment and receiving rewards. This is the logic engine behind my dynamic ad-budgeting systems.

VON NEUMANN

THE BOTTLENECK

The Old Way

Traditional chips separate memory and processing, wasting energy moving data. My work uses Neuromorphic architecture, where memory and compute coexist.

EDGE

LOCAL INFERENCE

Intelligence on Device

Running AI directly on sensors rather than the cloud. This requires extreme efficiency—making Spiking Neural Networks the ideal solution.

Operational History

Senior Staff Software Engineer

Google / Feb 2026 — Present

Neuromorphic Computing & Hybrid AI

  • Integration of spiking neural networks (SNNs) into scalable production systems.
  • Designing neuromorphic algorithms optimized for low-power edge devices.
  • Driving cross-functional efforts for custom AI accelerators (TPU/ASIC).
  • Publishing internal whitepapers on brain-inspired computing paradigms.

Staff Software Engineer

Google / 2022 — 2026

Ads ML Infrastructure

  • Architected and pioneered the Budgeting Optimization System (BOS), leveraging Reinforcement Learning (RL) to dynamically allocate multi-billion dollar ad spends. Developed the reward functions and policy gradients that balanced advertiser ROI with platform liquidity in real-time auctions.
  • Led the design of production-scale streaming pipelines for ad-click telemetry, enabling online learning and sub-minute model freshness. Integrated ScaNN-based retrieval and TFX orchestration to handle trillions of daily events with millisecond-level inference latency.
  • Partnered deeply with Site Reliability Engineering (SRE) to achieve 99.999% availability for the core ads serving stack. Orchestrated global failover protocols and automated 'circuit-breaker' logic to protect revenue integrity during high-volatility market events.
  • Set the multi-year technical roadmap for Ads Infrastructure, driving engineering standards for ML reproducibility and observability. Mentored senior engineers across the organization on the deployment of distributed RL agents in production.

Senior Software Engineer

Google / 2020 — 2022

Workspace (Chat & Drive)

  • Reduced global deployment latency by 60% for Google Workspace by architecting a unified TFX-based CI/CD pipeline, enabling sub-day iteration cycles for production ML models.
  • Scalability lead for Google Drive's collaboration engine; optimized p99 latency for real-time file state synchronization during a 300% surge in global traffic (2020–2021).
  • Enhanced file discovery via semantic retrieval; integrated multi-signal ranking models into the Drive 'Priority' backend, significantly reducing 'time-to-file' for Enterprise users.

Software Engineer

Google / 2018 — 2020

Google Lens

  • Contributed to real-time image recognition features for Google Lens.
  • Architected and optimized end-to-end computer vision and retrieval pipelines for Google Lens, enabling real-time visual discovery for 1B+ users. Leveraged multimodal learning to bridge the gap between pixel-level data and semantic understanding.

Research & IP

PATENT US-110452A Pending

Event-Driven Memory Allocation for Sparse Tensors

WHITEPAPER Google Internal

Optimizing RL Agents for TPUv4 Architecture

READ_WHITEPAPER

Initiate Handshake

Interested in discussing the future of neuromorphic computing or hybrid AI architectures?

Send Signal

System_Dependencies

C++
LOW LATENCY
Rust
MEMORY SAFE
JAX
AUTOGRAD
Verilog
HDL
Torch
DEEP LEARNING
TPUv4
HARDWARE