Steffen Mueller

Software Engineer

Tales from a Start-Up Company and its Evolving Architecture

Aug 4, 2023

Strategy
Architecture
Event-Driven Architecture
Cloud Platform

Recently, I have been giving a talk to students at TU Berlin about Cloud-native engineering and software architecture as a part of Prof. Tai’s lecture “Cloud Native Engineering and Architecture”. I explained to the students the approach to software architecture and engineering that my team and me are following. From a 10,000 feet perspective, we are developing an Event-Driven Microservices Architecture to build a customer-first product based on a Cloud-native platform. But there is much more in the background such as the concepts of Minimum Viable Product, Minimum Viable Architecture, etc. that is essential to building such a customer-first product the way we do. This article summarizes the essential points of the talk, connects topics to some of my previous articles, and provides further insights into some core concepts we use.

Introduction

Disclaimer: This article presents my personal opinions and perspectives on a Cloud-native development strategy, software architecture, and engineering. So, this is not necessary my company’s opinion.

In a couple of articles (see, e.g.: this article), I have already explained that I am currently working at a corporate start-up. The company (HUK-Autoserivce) started in 2021 to build an application for booking car services online based on a modern product and development approach such as agile development processes and a Cloud-native platform. In development, we, as nearly every start-up, fight with diverse challenges such as continously changing requirements, a small team implementing these changes and all the new features, an always changing software architecture, a limited budget, etc.

Thereby, my company and its challenges are by far not special. Similar to my company, all start-ups typically have their challenges during the evolution and during different phases of their journey. When a start-up is, for example, in the initial phase of proving its business model, the start-up should focus on using rock-solid technology to build a prototype as fast and as cheaply as possible to prove the business model. When the start-up and the business model evolves, the journey continues and the start-up has to scale the business model, maybe having to handle millions of requests per second or/and maybe in the number of teams building the product. Always, there is the challenge to stay technically ahead of the business and to stay flexible with the organization. Randy Shoup structures this journey of start-ups and their challenges during their evolution nicely based on the business growth s-curve (see also: [1]) in a very good talk [2]. In terms of software architecture and development, it boils down to the meaningful start of his talk: “There is no perfect architecture for all scales, for all phases of evolution, [and] for all problem domains.” [2]

In 2021, my employer has rather started—and is now for sure—in the scaling phase, because the business model has been proven basically earlier. When I have joined and the development of the product has been restarted (see also: this article, we have just slightly adapted the business model to scale. But how does such a start-up in the scaling phase run its business and trying to stay focussed, flexible, fast, and cost-effective?

Back then, we have started to build an Event-Driven Architecture (EDA) based on microservices and are still continuing this approach (see also: this section). Furthermore, we follow a platform strategy (see also: this section). But more important: we clearly follow the idea of Minimum Viable Architecture (MVA) explained in Randy Shoup’s talk [2]. So, let us first dive deeper into the concept of MVA and our approach to it in the next section.

Minimum Viable Architecture and our Approach to it

Software architects and engineers often have to decide how much architectural design they do in the beginning of a project or product or even when developing a new feature. The concept of MVA focusses only on the essential architecture needed to deliver the Minimum Viable Product (MVP). An MVP is a version of a product with just enough features to be delivered to early customers for providing feedback as early as possible for further product development [3]. When following the concept of MVA, you should delay design decisions until they are absolutely necessary to avoid lengthy and unnecessary work. On top, it is better to architect for change in general [4]. The most memorable explanation of MVA is that you concentrate on delivering “just enough architecture” for releasing an MVP [5], [6], [4], [7], [8], or [2].

As already explained via the start of Randy Shoup’s talk [2] in the previous section, there is no perfect architecture that fits all scales, alls phases, and all problem domains. Thus, the MVA approach is, in my believe besides the MVP concept, the most essential lesson to learn for software engineers—especially when you are in a start-up: Just do enough architecture and build for change; do not overengineer your solution. The MVA approach, then, can help to stay focussed, flexible, and, thus, also fast.

How does an Event-Driven Architecture fit into that?

As already mentioned in this section and this article before, we have decided to go with an EDA from a 10,000 feet perspective when starting our journey. We use Domain-Driven Design (DDD) [9] to identify and structure our business domain and the business/domain events for the EDA. Simply, consider the business/domain events as records of business-significant occurrences in a bounded context—when you are not familiar with the term bounded context, simply use the term microservice instead. An EDA, in general, has the following benefits [10] and [11] (see also: this article):

Loose Coupling: Event producers and consumers in an EDA are coupled loosely and communicate asynchronously via an event broker (data format coupling only).
Partial Resiliency: The event broker separates event producers and consumers from each other and, thus, introduces partial resiliency to system parts.
Partial Scalability: Due to loosely coupled and separated producer and consumer components, we can scale our producers and consumers independently.
Single Source of Truth: Our event broker is the single source of truth which keeps domain/business events and allows us to rewrite, combine, extract, rework, … the events to feed other systems and purposes.

Now, you could argue that an EDA does not fit into the MVA approach, because it is rather a complex architecture approach. However, we use the EDA to structure our overall system rather from a 10,000 feet perspective. Think about our EDA rather from the solution or enterprise architecture perspective (see also: Levels of Architecture). On the application architecture level, we use “simpler”/other architecture approaches. We run independently deployable microservices/modular monoliths per team—this topic is probably a very interesting discussion for future articles ;-). In essence, you can think of the EDA as our integration mechanism with the overall system. When needed, we extend our microservices with new such domain/business events. Furthermore, we are able to completely restructure or rebuild the different microservices (internally) as long as we do not change the business/domain events.

So far, this way to work with the MVA approach and the EDA went well and kept us focussed, flexible, and fast—of course there are also challenges to maintain the EDA architecture. For example, defining and raising the events and, then, integrating with other systems is also cumbersome sometimes. But no software architecture is free of any challenges. For more details about the EDA approach, we also refer to this article.

A Cloud-native Platform as a Basis for the Development

According to the Team Topologies approach [12], [13], and [14], we try to structure our development teams into stream-aligned product and platform teams. The latter develops and maintains our Cloud-native platform to build business applications upon it. The platform reduces the complexity for the stream-aligned product teams, so the stream-aligned product teams can focus on dealing with the business complexity (… and, in our case, with the EDA) as well as the product development processes (see also: [15]). The stream-aligned product teams, as part of the business value stream, have end-to-end responsibility for building, deploying, running, supporting, and eventually sunsetting their part of the business or that slice of service. They are cross-functional and consist of front- and back-end developers as well as product owners (PO) and UX/UI designers.

The platform is built as a product and is Cloud-native. It is providing foundational capabilities, basic frameworks, and “experiences”/best practices to facilitate and accelerate the product development of the stream-aligned teams [16]. All in all, you can consider the platform as an intermediate layer between our Cloud provider(s) and the internal customers such as the stream-aligned product teams. The figure below depicts our (current) platform. The basic structure of the platform in the figure is based on the Cloud Native Computing Foundation’s (CNCF) definition of a platform as well as platform engineering in [16].

Our Cloud-native Development Platform

Our basic Cloud providers are Amazon Web Services (AWS), GitHub, Atlassian, and some further providers that are not shown in the figure. This foundational basis is visible at the bottom of the figure.

Using the foundational basis, our platform provides its platform capabilities to be able to provision resources, to provide authentication and authorization services, to deliver Continous Integration and Delivery (CI/CD), etc. The platform capabilities to provision resources are provided via Kubernetes as our container runtime (Amazon Elastic Kubernetes Service; we only run containerized applications so far) as well as Helm. We are able to provision diverse databases (data capabilities) via Amazon Relational Database Service. For the EDA, the platform provides our event broker, Apache Kafka, via Amazon Managed Streaming for Apache Kafka. Last but not least, Keycloak is used to serve authentication and authorization services.

For the platform interfaces, we provide documentation via the bug tracker, Atlassian Jira, and a wiki for documentation (Atlassian Confluence). Technical documentation is also done in readme files in GitHub repositories.

Additionally, we have basic environment and project templates in GitHub repositories as well as Terraform and Helm templates, etc. Our CI/CD bases on GitHub and GitHub Actions. For more information on our CI/CD, we refer to this article. Logging, monitoring, and tracing are delivered via Kibana, Grafana, etc.

For us, this thin platform works well with the stream-aligned product teams to build our applications. As a product, the platform is under continous development and improvement of the platform team. With the best practices “baked into” our platform as well as setup of environments and projects, we are able to come up with new containers/microservices in a couple of minutes and develop our applications fast.

Conclusion

Sometimes we, as engineers, software architects, etc., tend to extend needlessly—or rather overengineer—products and services. The MVP and MVA approaches can help us to concentrate on delivering the necessary things (see also: this section). Especially in the context of a start-up, those concepts are essential, as there are always evolutionary steps and changes you cannot forsee and should not pre-consider because you would waste time and money. For my team and me at a corporate start-up, the MVP and MVA concepts definitely help to stay focussed, flexible, and fast—simply, it prevents us from doing unnecessary work.

To work focussed, flexible, and fast, we rely on an EDA (see also: this section). The MVA is our approach to concentrate on necessary things when running, developing, and driving the EDA. The EDA works as our integration mechanism from a 10,000 feet perspective. When we need new domain/business events, we integrate them into the EDA and connect the other microservices and systems with it—let it be SAP, the new CRM, another microservice, or another system of a provider. On lower levels, our EDA works with microservices that are independently deployable units. Overall, the MVP and MVA concepts are key to our way to work with the EDA as well as our microservices implementing the EDA.

The EDA is running on a Cloud-native platform developed and maintained by the platform team (see also: this section). The Cloud-native platform reduces the complexity to the stream-aligned product teams and helps them to concentrate on implementing business applications upon it. The foundational capabilities, basic frameworks, and “experiences”/best practices of the platform work well in our business context. At the moment, two DevOps/Site Reliability Engineers and an expert for authentication and authorization services are maintaining the platform and serving eight+ developers (front-end and back-end), two PO, and two UX/UI designers—overall, I would say a small but agile, powerful, and cost-effective team in total.

Essentially, this is our way to build and run our application/overall system for delivering car services to you: MVP, MVA, EDA, and a powerful platform.

References

G. P. Boretos, “S-curves and their Applications in Marketing, Business, and the Economy,” Marketing Research Association Alert, Dec. 2012.
R. Shoup, “Minimum Viable Architecture.” Dec-2022.
Wikipedia contributors, “Minimum viable product.” Jul-2023.
P. Pureur, “Minimum Viable Architecture: How To Continuously Evolve an Architectural Design over Time.” 21-Dec-2021.
P. Caroli, “Minimum Viable Architecture (MVA).” 14-Dec-2015.
D. Karanth, “How to Create a Minimum Viable Architecture.” 23-Feb-2016.
M. Erder, E. Woods, and P. Pureur, Continuous Architecture in Practice - Software Architecture in the Age of Agility and Devops. Addison-Wesley Professional, 2021.
K. Bittner and P. Pureur, “A Minimum Viable Product Needs a Minimum Viable Architecture.” 08-Jun-2022.
E. J. Evans, Domain-Driven Design. Addison Wesley, 2003.
G. Hohpe and B. Woolf, “Enterprise Integration Patterns - Introduction to Integration Styles.” 2003.
G. Jansen and J. Saladas, “Advantages of the event-driven architecture pattern - What is event-driven architecture and why use it?” 17-Jun-2020.
M. Skelton and M. Pais, Team Topologies - Organizing Business and Technology Teams for Fast Flow. It Revolution Press, 2019.
M. Skelton and M. Pais, “Monoliths vs Microservices is Missing the Point—Start with Team Cognitive Load.” Sep-2019.
M. Skelton and M. Pais, “Monoliths vs Microservices is Missing the Point—Start with Team Cognitive Load - Team Topologies.” Jul-2019.
M. Schultheiss and D. Gehrke, “Produktorganisationen - Wie erfolgreiche Unternehmen ihre Produktorganisation als unfairen Vorteil einsetzen,” in Digitales Produktmanagement: Methoden – Instrumente – Praxisbeispiele, S. Hoffmann, Ed. Springer Gabler Wiesbaden, 2023.
Cloud Native Computing Foundation, “CNCF Platforms White Paper.” 28-Mar-2023.

Acknowledgements

Huge thanks go to the entire product and development team of HUK-Autoservice for the awesome work.