Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. This project is suited for a Senior Python developer with deep functional testing experience and strong skills in Linux and Docker.
Responsibilities:
- Create functional black box tests for large codebases in various source languages
- Create and manage Docker environments to ensure 100% reproducible builds and test execution across different platforms
- Monitor code coverage and configure automated scoring criteria to meet industry benchmark-level standards
- Leverage LLMs (Roo Code, Claude) to accelerate development cycles, automate repetitive tasks, and improve overall code quality
Requirements:
- 5+ years of experience as a Software Engineer (primarily Python)
- Deep experience with pytest (fixtures, session-scoped, timeouts) and designing black-box functional tests for CLI tools
- Expert-level Docker skills (reproducible Dockerfiles, user contexts, secure workspaces)
- Strong Linux & Bash scripting skills and comfort debugging inside containers
- Proficiency with modern Python tooling (uv, pyproject.toml, packaging)
- Ability to read and understand with LLM many coding languages (for example C, C++, Rust, or Go)
- Experience using LLMs (Claude Code, Roo Code, Cursor) to accelerate iterative development and test-case generation
- English language - B2 or higher
- Prior experience with agent evaluation platforms and MCP CLI