Python AI Engineer
We’re hiring! Join ItexUS today!
We are seeking an experienced Python AI Engineer to develop and optimize systems interacting with large language models (LLMs), including chatbots, agent systems, and Retrieval-Augmented Generation (RAG) architectures. You will work on building high-performance, scalable solutions integrated with cloud platforms and modern AI tools, ensuring rapid development pace and high code quality. The role involves both independent work and collaboration within cross-functional teams.
What you’ll do:
- Develop and optimize backend systems using Python and FastAPI to build APIs integrated with LLMs (OpenAI API, Grok, GPT-5).
- Design and implement RAG pipelines for data processing, vectorization, and answer generation using OpenAI libraries.
- Build and maintain chatbot systems leveraging NLP technologies.
- Develop agent-based systems for text analysis and multimedia generation (audio, video, avatars).
- Integrate with cloud platforms (AWS) using serverless and microservice architectures.
- Work with vector databases (PostgreSQL + pgvector) for semantic search and data handling.
- Ensure code quality through regular code reviews and testing.
- Participate in LLM fine-tuning, prompt engineering, and model optimization for specific use cases (e.g., text summarization).
Requirements:
- 4+ years of Python development experience and 3+ years with FastAPI or Django.
- Strong understanding of asynchronous programming in Python.
- Hands-on experience with LLMs (OpenAI API, GPT-4/4o, Grok).
- Solid knowledge of RAG, prompt engineering, and fine-tuning techniques.
- Familiarity with NLP libraries (e.g., LangChain).
- Experience with containerization using Docker and Kubernetes.
- Strong database knowledge: PostgreSQL and vector databases (e.g., pgvector, ChromaDB, FAISS).
- Cloud experience, particularly AWS (ECS, S3, Lambda).
- Proficient in Git and CI/CD tools (GitLab CI/CD, Azure DevOps).
- English level B2+: ability to communicate confidently with both technical and non-technical stakeholders.
- Strong analytical skills, attention to detail, and the ability to work independently in a fast-paced environment.
- Quick learner with the ability to adapt to new technologies and client-specific tasks.
Nice to have:
- Experience with local LLMs and inference.
- Basic knowledge of frontend technologies (React, Next.js, TypeScript).
- Experience with real-time technologies (WebSocket, Socket.IO, WebRTC).
- Knowledge of MLOps/LLMOps (MLflow, Kubeflow) and vLLM.
- Experience with task queues (Dramatiq, Redis) and caching strategies.
- Familiarity with AWS CDK for infrastructure automation.
- Experience working with multimedia processing (audio/video, avatars, STT/TTS).
We would be delighted to share more information about our company during our meeting.
Job benefits you will appreciate
Modern office in the city center near the metro station
Friendly professional team in a fast-growing company
Competitive salary
Flexible working hours
Professional events & meetups
Fully-stocked kitchen
Corporate library and the book club
Parking for bicycles & electric scooters
Year-round team building events and corporate parties (local & abroad)
