Role: Data Scientist with Python and GenAI
Work model: Remote (+ occasional visit to the Warsaw office during client visits)
Rate: 135 - 150 PLN/h netto + VAT
Start: ASAP
Project length: 12 months contracts + extensions
Project language: English, Polish
Workload: Full time
We are a team of creative professionals focused on the development of DATA in its broadest sense. Our expertise covers machine learning, MLOps, Data Engineering, and AI-related projects. We work on ML models, forecasting, and Explainable AI solutions for clients from various industries, including banking, insurance, retail, and more, across Europe, the US, South America, and Asia.
RESPONSIBILITIES:
- Develop innovative solutions using advanced machine learning and AI technologies.
- Model, develop, and validate intelligent assistants leveraging state-of-the-art large language models (e.g., GPT-4, Falcon 2, LLAMA 3, Mixtral), utilizing Retrieval Augmented Generation (RAG) techniques and AI agent frameworks (e.g., Langraph, CrewAI).
- Utilize AI expertise to recommend the best technical approaches and solution architectures for business challenges.
- Communicate complex insights clearly and effectively to both technical and non-technical audiences.
- Collect, preprocess, and analyze large datasets for training ML models.
- Contribute to project documentation and knowledge sharing.
REQUIREMENTS:
- 4+ years of relevant professional experience.
- Hands-on experience in building applications using Large Language Models (LLMs) and related concepts such as RAG, vector databases, embeddings, prompt engineering, and multi-agent systems.
- Familiarity with LLM frameworks (e.g., Langchain, LLamaindex).
- Strong Python programming skills.
- Core competencies in statistics.
- Solid understanding of ML/AI concepts, including types of algorithms, ML frameworks, model efficiency metrics, AI architectures, and model lifecycle.
- Experience in training and using transformer-based models for text and/or image data.
- Expertise in working with large datasets, including data cleaning, transformation, and manipulation (e.g., Pandas, NumPy, SQL).
- A degree in Economics, Econometrics, Quantitative Methods, Computer Science, Mathematics, Physics, Operational Research, or a related discipline.
- Fluency in English (written and spoken).
- Excellent analytical and problem-solving skills.
Nice to have:
- Knowledge of GenAI agent frameworks and Python libraries (e.g., Langraph, CrewAI, Autogen, Taskweave).
- Understanding of Natural Language Processing (NLP) techniques.
- Additional programming language skills (e.g., C#, Go, Java).
- Knowledge of reinforcement learning concepts.