Location
toronto
Job Type
Full-time
Posted
June 04, 2026
Job Description
Join to apply for the
Software Engineer – Inference Serving
role at
Taalas
At Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We are building a team of hands‑on technologists who dislike overspecialization and seek to excel in both depth and breadth. In this position the successful candidate will build software infrastructure for an inference serving cluster built around Taalas hardcore AI model chips.
Job Responsibilities
Adapt open‑source inference servers like vLLM and Punica to interface with Taalas’ hardcore AI models
Implement a highly efficient LoRA swapping solution for multi-{tenant,LoRA} environments
Build and test a scalable inference serving cluster using K8 and Traefik or similar
Qualifications
Bachelor’s or higher degree in Computer Science...
Software Engineer – Inference Serving
role at
Taalas
At Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We are building a team of hands‑on technologists who dislike overspecialization and seek to excel in both depth and breadth. In this position the successful candidate will build software infrastructure for an inference serving cluster built around Taalas hardcore AI model chips.
Job Responsibilities
Adapt open‑source inference servers like vLLM and Punica to interface with Taalas’ hardcore AI models
Implement a highly efficient LoRA swapping solution for multi-{tenant,LoRA} environments
Build and test a scalable inference serving cluster using K8 and Traefik or similar
Qualifications
Bachelor’s or higher degree in Computer Science...
Ready to Apply?
Submit your application for Software Engineer – Inference Serving at Taalas
Apply Now