Explore GitLab
Discover projects, groups and snippets. Share your projects with others
-
The mw CLI tool (with development environment & gitlab commands)
Updated -
Updated
-
Collection of data engineering DAGs to be executed by the WMF Airflow instances.
Updated -
Collection of data engineering DAGs to be executed by the WMF Airflow instances.
Updated -
Proof of concept for Terraform and Flink on WMCS
Updated -
-
Updated
-
Utilities and libraries for working with data pipelines at WMF. E.g. distributing conda envs, building and syncing artifacts, etc.
Updated -
Repository for configuration of general purpose GitLab Cloud Runner.
Updated -
-
A development environment for Scap3 and the Scap self-installer
Updated -
-
In the heart of the Section topics data pipeline lies a set of Spark jobs.
Updated -
Data jobs owned by the Global Data and Insights team.
These should use conda-dist from workflow_utils to generate conda env artifacts, which are then deployed to the Analytics Data Lake for scheduling and running by Airflow.
Updated -