arxi is a personal ai agent that lives inside your messenger. one agent, one user, one isolated micro-vm that reads, decides and acts on your behalf. we're a tiny team that ships to production multiple times a day. our product holds people's real data, calendars, credentials and browsers, so security is not a feature, it's the substrate.

we ship to production many times a day with no staging theater. that only works if quality is engineered, not hoped for. we're looking for a qa engineer who can break things by hand and in code today, and who wants to build the qa function from zero: the strategy, the harnesses, the agent-driven test fleets, the bar that keeps a same-day-deploy culture safe. this is a founding qa role. you won't inherit a test suite and a process, you'll define them. and because our product is an autonomous ai agent, testing here goes beyond deterministic asserts: you'll evaluate non-deterministic agent behavior, accuracy, and regressions, and you'll wield ai agents themselves as testers.

how we work

our stack

typescript (grammy, fastify), python (fastapi), next.js with trpc and prisma, gemini via vertex ai behind our own llm proxy, firecracker micro-vms (one per user), self-hosted linux (hetzner, nginx, systemd), sqlite, prometheus and grafana, polar for payments. heavily automated tooling and a fast deploy pipeline, so engineers spend their time on the hard problems rather than the plumbing.

in this role you will

you might be a great fit if you have