Research Engineer (LLM Architecture)

Kog · Paris, Île-de-France, France

Location
Paris
Job Type
CDI
Posted
June 19, 2026

Job Description

Description du poste

You will imagine, design and run experiments to understand how architectural decisions propagate through inference behavior, morph existing open-weight models into architecture variants optimized for speed, and turn findings into measurable gains in generation speed and model quality.


Design new model architecture variants, including routing strategies, attention mechanisms, and MoE structure, with execution constraints as a first-order design input.

Extend the Laneformer thesis by exploring inference-aware architectural variants such as DTP, Ladder Residual, and PT-Transformer, and finding what compounds at scale.

Own the post-training pipeline across fine-tuning, evaluation methodology, and adaptation of existing open-weight models toward architecture variants optimized for inference speed.

Scale the stack to large MoE models such as DeepSeek v4 and Qwen 3, working through routing, expert parallelism, and comm...

Ready to Apply?

Submit your application for Research Engineer (LLM Architecture) at Kog

Apply Now