Open in app

Sign In

Write

Sign In

Anna Geller
Anna Geller

5.3K Followers

Home

About

Mar 2

2023 State of Data Infrastructure — Key Trends from Matt Turck’s MAD Landscape

Summary of the 2023 data infrastructure from the perspective of practitioner talking (a lot) to end-users from data communities — Matt Turck has recently published the 2023 MAD (Machine Learning, Artificial Intelligence & Data) Landscape overview. Similar graphics have been made for 2012, 2014, 2016, 2017, 2018, 2019 (Part I and Part II), 2020, and 2021, and now there is a PDF and an interactive version of the 2023 landscape.

Data

8 min read

2023 State of Data Infrastructure — Key Trends from Matt Turck’s MAD Landscape
2023 State of Data Infrastructure — Key Trends from Matt Turck’s MAD Landscape
Data

8 min read


Published in The Prefect Blog

·Feb 1

Should You Measure the Value of a Data Team?

What to measure and whether you should — Data teams are sometimes asked to prove their ROI to senior leadership to justify a budget for new hires, tools, projects, or process changes. But the work of data teams is inherently unmeasurable. Often the reason for this ROI question isn’t rooted in a lack of proper metrics but rather…

Data

7 min read

Should You Measure the Value of a Data Team?
Should You Measure the Value of a Data Team?
Data

7 min read


Jan 15

Giving and Receiving Negative Feedback

How to do it respectfully and in a way that improves your relationships rather than harming them — One of the most challenging aspects of giving and receiving negative feedback is understanding how people can respond to it. While some prefer more direct feedback, others can feel personally attacked by it. You need to know the person on the other side to assess that correctly. …

Feedback

4 min read

Giving and Receiving Negative Feedback
Giving and Receiving Negative Feedback
Feedback

4 min read


Published in The Prefect Blog

·Jan 9

Prefect & Fivetran: integrate all the tools & orchestrate them in Python

Cloud data ingestion and orchestration made easy — What is Prefect? It’s a flexible framework to build, reliably execute and observe your dataflow while supporting various execution and data access patterns. It lets you turn any Python script into a fully operationalized application. What is Fivetran? It’s a data integration platform that allows businesses to replicate data from various systems into a central data…

Data

7 min read

Prefect & Fivetran: integrate all the tools & orchestrate them in Python
Prefect & Fivetran: integrate all the tools & orchestrate them in Python
Data

7 min read


Published in Better Programming

·Jan 5

How To Securely Parse GitHub Actions JSON Secrets for Azure CI/CD

A quick but secure way to extract fields from JSON-based secrets — Several cloud vendors provide credentials as a JSON object. For instance, Google Cloud allows the creation of a credentials JSON file that contains a.o. project identifier, private key, and client email of a service account. …

Programming

4 min read

How To Securely Parse GitHub Actions JSON Secrets for Azure CI/CD
How To Securely Parse GitHub Actions JSON Secrets for Azure CI/CD
Programming

4 min read


Published in The Prefect Blog

·Dec 20, 2022

What I learned from NormConf 2022

Summary of selected talks and lessons learned — NormConf is an online tech conference about things that matter in data and ML but don’t get the spotlight. As something that started as a Twitter joke, NormConf 2022 exceeded anyone’s expectations. It encompassed many excellent presentations from smart people sharing stories from real-life experiences in the field. All talks…

Data Science

8 min read

What I learned from NormConf 2022
What I learned from NormConf 2022
Data Science

8 min read


Published in The Prefect Blog

·Dec 19, 2022

GCP and Prefect Cloud — from Docker Container to Cloud VM on Google Compute Engine

Repository template with GitHub Actions will deploy your first Python dataflows to Google Cloud in minutes — Google Compute Engine allows running virtual machines (VMs) on GCP. It’s a scalable platform for running a wide range of workloads, including custom (Python) applications, data processing, and machine learning — an ideal execution layer for Prefect flows. …

Data Engineering

10 min read

GCP and Prefect Cloud — from Docker Container to Cloud VM on Google Compute Engine
GCP and Prefect Cloud — from Docker Container to Cloud VM on Google Compute Engine
Data Engineering

10 min read


Published in The Prefect Blog

·Dec 8, 2022

Why Prefect

Orchestration is just one part of the dataflow equation — What is Prefect? It’s a flexible framework to build, reliably execute and observe your dataflow while supporting a wide variety of execution and data access patterns. Why should you care? The Modern Data Stack encompasses a large array of highly specialized components. You can find tools for data ingestion, transformation, analysis, validation, cataloging — the list goes…

Data Science

10 min read

Why Prefect
Why Prefect
Data Science

10 min read


Published in The Prefect Blog

·Dec 6, 2022

Schedule & orchestrate dbt Cloud jobs with Prefect

Modular Data Stack with dbt Cloud Prefect block — This short post will walk you through how to set up dbt Cloud jobs and orchestrate those with Prefect. It assumes that you have already signed up for dbt Cloud and know how to use dbt. dbt Cloud setup First, you need to retrieve the dbt Cloud account ID and create an API…

Analytics Engineering

5 min read

Schedule & orchestrate dbt Cloud jobs with Prefect
Schedule & orchestrate dbt Cloud jobs with Prefect
Analytics Engineering

5 min read


Dec 2, 2022

How to manage data teams, build a reliable platform & ensure data quality

Answers to some of the hardest questions in data — #1 How to manage a data team Managing a data team can be challenging, as data professionals often have unique skills and expertise that require specialized knowledge to support and manage them. Here are some guidelines to follow: Clearly define roles and responsibilities: Each data team member needs to understand their tasks and objectives. This will help…

Data

6 min read

How to manage data teams, build a reliable platform & ensure data quality
How to manage data teams, build a reliable platform & ensure data quality
Data

6 min read

Anna Geller

Anna Geller

5.3K Followers

Lead DX Engineer, Data Professional, Cloud & .py fan. www.annageller.com. Get my articles via email: https://annageller.medium.com/subscribe

Following
  • Desiree Peralta

    Desiree Peralta

  • Dario Radečić

    Dario Radečić

  • Sinem Günel

    Sinem Günel

  • Roman Orac

    Roman Orac

  • Allen Helton

    Allen Helton

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech