Data Engineer
Company: Speak
Location: San Francisco
Posted on: June 2, 2025
Job Description:
About usOur mission is to reinvent the way people learn,
starting with language. We begin by teaching the next billion
people English, Spanish, and French.English is the global language
of business, culture, and communication, and over 1.5 billion
people around the world are actively trying to learn right now.
Others dream of communicating with the half-billion native Spanish
speakers across the globe. The problem is that it's nearly
impossible to learn tospeaka language without constant access to a
speaking partner. Grammar and vocab apps don't really help - you
need to actuallyconversewith someone.Speak is on a journey to fix
this. We're creating an AI-powered experience that replicates the
flow of a conversation,withoutneeding a human on the other end. The
goal is to make it radically more accessible to be able to have
conversations in a foreign language and eventually help hundreds of
millions of people gain fluency who otherwise wouldn't be able
to.We started on this journey over five years ago and we've still
got a long ways to go. We're thoughtfully adding new team members
only when we think they can truly play a big role in our
mission.Speak launched first in South Korea where we have quickly
grown to become the top grossing education app in the country. We
have now delivered this winning product to more than 40 countries
globally and are continuing to expand to more markets in the coming
months. The company is well funded, and as of December 2024, we've
reached a $1B valuation with our Series C round, through key
partners like Accel, OpenAI, Founders Fund, Y Combinator, Khosla
Ventures, Lachy Groom, Josh Buckley, and more. We're a team of more
than 90 based throughout San Francisco, Seoul, Tokyo, Taipei, and
Ljubljana.About this roleAs a Data Engineer at Speak, you'll play a
pivotal role in shaping the future of digital language learning,
propelling us towards our mission of making language proficiency
accessible to millions worldwide.Your responsibilities will span
the crucial intersection of data infrastructure and analytics, from
managing scalable data pipelines, to deploying sophisticated
analytics solutions that drive personalized learning experiences.
You'll work closely with our product, engineering, and analytics
teams to ensure that our platform, powered by cutting-edge
technology, is not only robust but also delivers actionable
insights that enhance user engagement and learning outcomes.What
you'll be doing
- Design and Build Data Infrastructure: You'll architect and
implement robust, scalable data pipelines using Airflow for
orchestration and dbt for transformation that ensure efficient data
flow and processing. Your work will be critical in managing the
ingestion, storage, and accessibility of data from various sources,
ensuring our platform's backbone is strong and reliable.
- Enable Data-Driven Decisions: By collaborating with
cross-functional teams, you will develop and deploy tools and
frameworks that facilitate data access and analysis, empowering
product and business teams to make informed decisions.
- Optimize Data Architecture: Constantly evaluate and refine the
data architecture to support our growing data needs and ensure
optimal performance. This includes managing a data warehouse and
various data sources, as well as implementing best practices for
data modeling, data quality, and data governance.
- Support Machine Learning Projects: Work closely with analysts
and machine learning engineers by providing them with clean,
structured data for building and deploying predictive models that
enhance personalized learning experiences and engagement
strategies.
- Innovate and Experiment: Stay ahead of the curve by researching
and implementing cutting-edge technologies and methodologies in
data engineering and analytics.
- Collaborate Across Teams: As a key player in the engineering
team, you'll work closely with product managers, analysts, and
other engineers to bring data-driven products and features from
concept to launch.What we're looking for
- Data Modeling: Deep understanding of big data warehouses
(BigQuery, Snowflake, Redshift), theories, principles, and
practices. Ability to design, implement, and manage data warehouses
effectively.
- Programming Skills: Strong programming skills in Python and
SQL. Ability to write efficient, reliable, and maintainable
code.
- Data Pipeline and ETL Development: Experience in building and
optimizing data pipelines, architectures, and datasets. Familiarity
with ETL (extract, transform, load) processes and tools.
- Big Data Technologies: Experience with end-to-end data platform
beyond creating pipelines, such as data ingestion, reverse ETL,
visualization, data observability, etc.
- Cloud Computing: Knowledge of cloud services (GCP, AWS, dbt)
and understanding of how to leverage them for data processing and
storage solutions.
- Data Analysis and Visualization: Ability to analyze data to
identify patterns, anomalies, and insights. Proficiency in using
data visualization tools (e.g. Mode) to communicate findings
clearly.
- Debugging Skills: Strong problem-solving skills and the ability
to approach complex challenges methodically including data
inconsistency issues.
- Effective Communication: Ability to communicate technical
information to non-technical stakeholders clearly and effectively.
This includes writing documentation, presenting findings, and
collaborating on projects.Office
- San Francisco, CAWhy work at Speak
- Join a fantastic, tight-knit team at the right time:we're
growing very quickly, we've most recently raised our Series C from
some of the top investors in the valley, and we've achieved
product-market fit in our initial markets. You'd join at a magical
time when a single person could significantly change the course of
the company.
- Do your life's work with people you'll love working with:we
care strongly about our craft and want every person at Speak to
feel like they're growing every day. We believe in the idea that
working with people you both enjoy and have respect for makes
everything better. We hire thoughtfully and only work with people
we admire deeply.
- Global in nature:We're live in over 40 countries and launching
in a number of new markets soon. We have dedicated offices in San
Francisco, Ljubljana, Seoul, and Tokyo, and you'll have the
opportunity to talk to users in each of these regions on a regular
basis as well as travel.
- Impact people's lives in a major way:Learning a language is one
of the single most life-changing skills one can learn, and right
now 99% of people never achieve their goal because the process is
broken. We're helping millions of people achieve their goals and
improve their lives.Speak does not discriminate based upon race,
religion, color, national origin, gender (including pregnancy,
childbirth, or related medical conditions), sexual orientation,
gender identity, gender expression, age, status as a protected
veteran, status as an individual with a disability, or other
applicable legally protected characteristics.
#J-18808-Ljbffr
Keywords: Speak, Rohnert Park , Data Engineer, Engineering , San Francisco, California
Didn't find what you're looking for? Search again!
Loading more jobs...