The Role
We are looking for a (Senior-) Data Scientist who is excited to join us in building the first global supplier search. We believe that Data Scientists should accelerate the business through state-of-the-art technology, and collaborate across multiple teams to accomplish that. Your role will focus on building robust NLP models to extract, process, and normalize information from large amounts of unstructured data. You will also have the opportunity to work with our Founders and Data Analysts to understand the core business problems and provide the right solutions.
We mainly work with Python, PostgreSQL, dbt, Airflow, K8s, Terraform, and GitLab CI. We are open to using other technologies and excited to expand our tech stack. In addition, we believe in learners with a growth mindset, so feel free to apply even if you don’t know all of these technologies.
What we offer
- Working in a fast-paced environment with challenging tasks (zero-boredom-guarantee)
- High ownership and the freedom of managing your own projects
- Direct collaboration with the founders – true flat hierarchy
- Rapid professional development & leadership opportunities with a steep learning curve
- Awesome team events and an inclusive company culture with a diverse team
Your responsibilities
You will conceptualize, build, and maintain NLP models for different steps in our data lineage. You will become the owner of one or several data projects and orchestrate them together with Data Analysts / Data Engineers. Your focus will be some of the following:
- Expand our hand-tailored framework for data extraction and processing
- Understand the nuances of our data extraction problems and propose solutions
- Implement solutions using proven State of the Art techniques
- Enable automated data extraction pipelines by training and deploying models on our data infrastructure
- Implement statistical evaluations for every model to keep track of the progress both short and long term
- Prepare technical and non-technical presentations of implemented models
Your experience
- 4+ years of relevant Data Science work experience
- Extensive experience with Python and one of the modern deep learning libraries
- Extensive experience with NLP algorithms, e.g. NER, PoS, Topic Modeling, Sentence Classification, NMT
- Experience in code version control, e.g. git
- Experience with deploying models in production environments
- Experience with small teams in fast-paced environments
- Experience with, or strong interest in real-world, unstructured data of various formats
- Nice-to-have skill sets:
- application containerization, e.g. Docker
- data pipelining or workflow orchestration, e.g. Airflow, Kubernetes
- model monitoring in production
- Bonus: experience in procurement, logistics, or manufacturing
What we value
- Honest, fast, and open collaboration as well as strong communication skills (English)
- Resourceful self-starters who hold themselves and their team to high standards and have attention to detail
- Short release cycles, and active participation in our releases
- Team members who are excited about our mission & tech
If you’re an expert data scientist who loves greenfield projects, we’d love to talk to you!
Alpas is proud to be an equal opportunity employer. We view diversity as a moral imperative and competitive advantage. We are committed to equal employment opportunities regardless of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you have a disability or special need that requires accommodation, please let us know.