Skip to content
View ulmentflam's full-sized avatar

Highlights

  • Pro

Block or report ulmentflam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ulmentflam/README.md

Evan Owen - AI Researcher & Engineer

Typing SVG

LinkedIn Blog Portfolio Email


Hey, I'm Evan

AI researcher and systems engineer. A decade in production -- distributed systems, blockchain, and LLM infrastructure.

Most recently Co-Founder & CTO at QWERKY AI -- distilled 70B-parameter LLMs into 3B-8B hybrid models on 24 H200 GPUs, pending patent on the attention architecture.

Pursuing an MS in Computer Science at Georgia Tech. BS CS, Summa Cum Laude, University of South Carolina.

Current Research

  • Novel attention architectures -- custom CUDA kernels for hybrid LLM distillation (pending patent)
  • State Space Models in MAX -- Mamba SSM architecture: selective scan, causal conv1d, and RMSNorm kernels in Mojo

Tech Stack

Languages

C++ CUDA Python Go Rust Mojo Swift Solidity Nix

AI / ML

PyTorch Hugging Face DeepSpeed vLLM TensorFlow ROCm

Infrastructure

Kubernetes Docker Terraform AWS GCP

Featured Work

Nightly
Turns your coding CLI (Claude Code, Codex, Cursor, Gemini) into a self-directed, drainable session that lands review-shaped PRs by morning.
Python Stars

Corpus Forge
Syncs a local filesystem with a SQL database and vector embeddings -- storage and serving for sovereign AI models.
Python Stars

Autosentry
Self-healing supervisor for long-running processes -- watch a command, catch the failure, fix it, leave a paper trail.
Python Stars

Modular MAX Framework
Mamba SSM architecture with custom selective scan, causal conv1d, and RMSNorm kernels in Mojo.
Mojo CUDA Stars

QWERKY AI
Distilled 70B→3B-8B hybrid models on 24 H200 GPUs. 4x inference throughput, 1M token context. Pending patent on novel attention architecture.
Python CUDA C++

Pulley
iOS Maps-style drawer library with 2k+ stars, created at 52 Inc.
Swift Stars

Latest Writing

Read more on the QWERKY AI blog →

GitHub Stats

GitHub Stats Top Languages

Contribution Snake


Profile Views

Pinned Loading

  1. modular/modular modular/modular Public

    The Modular Platform (includes MAX & Mojo)

    Mojo 26.3k 2.8k

  2. 52inc/Pulley 52inc/Pulley Public

    A library to imitate the iOS 10 Maps UI.

    Swift 2k 262

  3. corpus-forge corpus-forge Public

    Corpus forge syncs a local file system with a SQL database with vector embeddings for storing and serving sovereign AI models.

    Python 1

  4. nightly nightly Public

    Work while you sleep — a host-native autonomous coding agent that lands draft PRs by morning.

    Python 3

  5. autosentry autosentry Public

    Self-healing supervisor for long-running processes — watch a command, catch the failure, fix it, leave a paper trail.

    Python 2

  6. key-gen key-gen Public

    A BIP-0044 compatible key generator for multiple blockchains. This is not a secure method, but is useful for quick key generation.

    Go 1