Job Description
Join Our Team as a Software Engineer – Data Engineering
At Catawiki, we celebrate the extraordinary every day. As a Software Engineer in Data Engineering, you will play a crucial role in building a strong data foundation that supports our growing global marketplace.
About the Role
Data is central to our decision-making at Catawiki. It fuels our commercial strategies, analytics, machine learning, and marketing efforts. In this role, you will help ensure our data infrastructure is robust and scalable, allowing teams to innovate with confidence. You will collaborate with Machine Learning Engineers, Platform Engineers, and Backend Engineers to enhance our data ecosystem.
What You'll Do
- Build and Scale Data Pipelines: Develop reliable batch and streaming pipelines to ingest data from internal systems and third-party sources into our data warehouse.
- Empower Data Science and AI: Maintain and improve the tools used by Data Scientists for analysis, experimentation, model training, and deployment.
- Protect Data and Privacy: Ensure secure data storage and consistent application of governance, access control, and privacy standards.
- Run and Evolve the Data Platform: Maintain the infrastructure for our data tools and applications, ensuring it is scalable, stable, and cost-effective.
- Own Core Data Tooling: Self-host and operate essential data engineering tools like Airflow and Airbyte on Kubernetes.
- Keep the Lights On: Provide operational support to ensure smooth and reliable functioning of pipelines, platforms, and tools across the business.
What We're Looking For
- Experienced Data Engineer: You have 3+ years of hands-on experience building and operating data systems in production.
- Strong in Python, SQL & Data Integration: You are fluent in Python and SQL, with experience in data integration tools like Fivetran and/or Airbyte.
- Infrastructure & DataOps Minded: You have experience with CI/CD, Infrastructure as Code (e.g., Terraform), and modern DataOps practices.
- Cloud & Platform Savvy: You have worked with cloud platforms (GCP is a plus) and are familiar with our data stack components such as BigQuery, PubSub, DataFlow, GKE, Airflow, Airbyte, FastAPI, and Prometheus.
- Comfortable with Streaming & Scale: You have experience with streaming pipelines using technologies like Kafka, Pub/Sub, Dataflow, or Apache Beam.
- Curious, Collaborative & Privacy-Aware: You are eager to learn new tools, support data platform and machine learning initiatives, and understand the importance of data privacy and GDPR.
Where You’ll Be
This role is based in Portugal (Lisbon) with a hybrid work arrangement (at least 2 days in the office per week).
What We Offer
- Create a visible impact: Work at scale in a global organization serving millions of customers across 80+ categories.
- Learn and grow: Benefit from our Learning & Development initiatives, including clear development plans and mentorship programs.
- A culture of connection: Join a diverse team of 800+ Catawikians from 60+ nationalities in an inclusive and welcoming environment.
- Celebrate life’s moments: Receive a €100 Catavoucher upon joining, a €50 Catavoucher on your birthday, and an extra day off each year to “Pursue Your Passion.” Additional leave is available for key work anniversaries and important life events. Benefits may vary by location.
Our Offices and Way of Working
Our offices in Amsterdam, Paris, and Lisbon are designed to inspire collaboration. Most Catawikians work in a hybrid setup, combining office and remote work, with a minimum of two days per week in the office, unless stated otherwise.
Interested?
If this sounds like the right opportunity for you, please apply with your CV and a cover letter in English. By applying, you agree to Catawiki’s Applicant Privacy Policy. If you’re excited about this role but don’t meet every requirement, we encourage you to apply anyway. You might be just the right candidate for this or other roles.
