YTC Logo
Home icon
Cases
Data Lake for European Newsroom

Data Lake for European Newsroom

Data Lake  for European Newsroom
BIDataAWS

We developed a comprehensive Data Lake solution for a non-profit organization aggregating traffic data to support news agencies' networks in Central and Eastern Europe. This advanced platform streamlines data collection, validation, and analysis, eliminating reliance on manual workflows like Google Sheets

Industry

Media & News

Technologies:

Google Cloud
AWS
Apache
Docker
GitLab
AirFlow

Business Impacts

  • Reduced manual effort for data compilation by over 50%, increasing operational efficiency
  • Improved data accessibility and sharing across news agency partners
  • Enabled partners to analyze and act on traffic trends in real time

Technical Highlights

  • ETL Pipelines: designed scalable and automated pipelines with Airflow for efficient data ingestion and processing
  • Custom BI Dashboards: created interactive dashboards using Superset, enabling better decision-making
  • Cloud Migration: transitioned from Google Cloud to AWS for improved scalability and reliability
  • Post-Production Support: ensured long-term maintenance and functionality of the platform
Additional image 1
Additional image 2