Published in Geek Culture·Jun 19Building a Data Lake and Warehouse on GCPFrom theory to practice, main considerations and GCP services — This article won’t be technically deep. We will talk about Data Lake and Data Warehouse, important principles which should be considered and which GCP services could be used. I will guide you along with GCP services which could be used to build and why you might consider them. Before going…Google Cloud Platform11 min read
Oct 16, 2021Branching in Airflow workflowsHow to avoid cascadian skip of tasks — With the growth of complexity of our Airflow DAGs, our workflows started to have multiple branches. Simple cases might be implemented with custom checks, more complex ones require utilizing the Airflow API. The latter has some not obvious behavior which I’m going to tell in this article and how to…Airflow3 min read
Jun 2, 2021Thoughts about breaking silos of software engineering teamsOr don’t break silos, but push out the silo mentality. — The problem of silos in organizations disturbs minds for ages. You can find a lot of thoughts about this topic on the Internet. They all have common thoughts and suggestions about overcoming the problem with silo teams. …Teamwork6 min read
Published in Geek Culture·May 31, 2021Python language through .NET developer eyesA list of useful code snippets and topics about Python general use scenarios which is worth to remember — If you start learning Python after being a C# developer for quite a long time, you might want to have the same toolset which you had in C#. Here I’d like to give a kind of a mapping from C# to Python, although I should admit it’s not an accurate…Python9 min read
Mar 20, 2021Why should you consider Airflow for your data platform?Finalizing three weeks of research & development in our team, I’d like to tell more about criteria which led us to choose Airflow among other interesting platforms for workflow orchestration. — On the way to build a process of data transition, scanning, wrangling and cleaning it’s essential to have an easy learning path and be able to address orchestration of workflows and their releasing as separate procedures. Since this is a new component in our ecosystem we need to make sure…Apache Airflow5 min read
Jan 4, 2021Applying data retention in MongoDB when using SiddhiWorkaround data representation in Siddhi to apply MongoDB index with TTL — Siddhi doesn’t support date/time type and can represent date/time values in strings and big integers. In systems which use date/time values to apply TTL logic it’s not possible to use Siddhi without workarounds. For example, in MongoDB it’s possible to create indexes with TTL settings which makes entries to expire…Siddhi3 min read
Dec 6, 2020Pillars of a Great TeamA lighthouse to improve of a team collaboration — The worst ever 2020 year (by version of The Time magazine) is coming to its end and we need more positive news. Let’s remember what is positive communication and positive thinking, from which traits communications benefit from and what traits have destructive influence. The further article will be based on…Collaboration5 min read
Nov 15, 2020Using Feature Flags and Externalized configuration .NET Core app with FlagsmithYou can use config-maps, attach volumes to containers in Docker or Kubernetes to substitute files, or you can use environment variables to extract configuration from applications. But these are dirty hacks in comparison to what you can do if you would have a service which would provide a configuration for…Feature Toggles6 min read
Sep 28, 2020Decreasing dependency on DevOps and minimizing time efforts on building of efficient CI/CD processWhen your DevOps are busy and developers are blocked, waiting for a new or updated CI/CD pipeline for a new project, you might start looking for a solution which would save your time and efforts on automation. In this article I’ll tell you how we established the CI/CD process for…Docker Compose10 min read
Sep 12, 2020Use boilerplate solution with “dotnet new” and stay DRYIn this article I want to share a small technique which saves time on repetitive tasks of creating similar projects. In my project there are multiple types of projects: CLI tool, a microservice, a data pipeline, a job. A programming language might be any, but we’re using often C# and…Boilerplate4 min read