this post was submitted on 30 Apr 2024

15 points (85.7% liked)

Programming

18253 readers

119 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities [email protected]

founded 2 years ago

MODERATORS

[email protected]

How do you holistically document microservices in a multi-repo setup? (programming.dev)

submitted 9 months ago by [email protected] to c/[email protected]

18 comments fedilink hide all child comments

Let's say I had a few microservices in different repositories and they communicated over HTTP using JSON. Some services are triggered directly by other microservices, but others can be triggered by events like a timer going off, a file being dropped into a bucket, a firewall rule blocking X amount of packets and hitting a threshold, etc.

Is there a way to document the microservices together in one holistic view? Maybe, how do you visualise the data, its schema (fields, types, ...), and its flow between the microservices?

Bonus (optional) question: Is there a way to handle schema updates? For example generate code from the documentation that triggers a CI build in affected repos to ensure it still works with the updates.

Anti Commercial-AI license

top 18 comments

sorted by: hot top controversial new old

[–] [email protected] 6 points 9 months ago (1 children)

Generate code from documentation

Let me stop you right there. If you want to generate API bindings, those should be generated from code, along with the documentation, not through it.

[–] [email protected] 1 points 9 months ago (1 children)

Generate API bindings from code? Then what's the code for? Do you have an example?

OpenAPI can generate bindings from their spec, but the spec only seems to describe a single microservice.

Anti Commercial-AI license

[–] breadsmasher 1 points 9 months ago (1 children)

I feel like this is just confusing specification with documentation.

Code generated from a specification, and documented. Swagger using the same specification could maybe sort of be documentation

[–] [email protected] 1 points 9 months ago (1 children)

For a while Peertube solely used OpenAPI to document the project. That's spec, documentation, and code generation in one. Dunno when they switched to a separate documentation tech + OpenAPI, but it's there.

[–] breadsmasher 1 points 9 months ago

Yeah thats more what I was thinking

[–] [email protected] 4 points 9 months ago (1 children)

Diagrams. Loads and loads of diagrams. One for each use-case.

Then I'd have one diagram to draw out dependencies between each service at the broadest level. Although depending on how messy your architecture is it can be very difficult to read, in my experience.

[–] [email protected] 1 points 9 months ago (2 children)

You do this all manually?

Anti Commercial-AI license

[–] [email protected] 2 points 9 months ago* (last edited 9 months ago) (1 children)

More or less. Either Excalidraw for your quick and dirty diagrams or I've used PlantUML + C4 Plug-in for your larger, more long lived diagrams with some success.

[–] [email protected] 1 points 9 months ago

I just gave PlantUML + the C4 Plugin a try and generally liked it, thank you for the rec!

It seems like a good tool although it inherits all the joys and pains of automatic graph layout.

I think I’ll keep it in my arsenal for detailed diagrams that can handle being a little aesthetically wonky.

I hadn’t heard of C4 before and it seems like a solid idea.

[–] [email protected] 1 points 9 months ago

I manually redraw my service architecture because I can create higher quality documentation than when trying to auto-generate it.

But you can get a baseline depending on which Cloud you use. For example, in AWS you can use workload discovery - that generates a system overview.

Bonus (optional) question: Is there a way to handle schema updates? For example generate code from the documentation that triggers a CI build in affected repos to ensure it still works with the updates.

Yes, for example, if your build server exposes the API with an OpenAPI scheme, you can use the build server to generate a client library like a nuget or npn.

Then in the API consumer you can add a build step that checks if there are new version of the client library. Or setup dependabot that creates PRs to update those dependencies

[–] [email protected] 3 points 9 months ago (2 children)

We’re using backstage in combination with openapi. The schema is documented in OpenAPI, but how services are connected is done via backstage, which crawls all repositories and puts it together to form nice graphs we can traverse easily

[–] [email protected] 2 points 9 months ago

That sounds great! I'll look into backstage. It's backstage.io?

Anti Commercial-AI license

[–] [email protected] 1 points 9 months ago* (last edited 9 months ago)

Backstage has become quite misaligned to what we were originally trying to do. Originally, we were trying to inventory and map the service eco-system, to deal with a few concrete problems. For example, when developing new things, you had to go through the village elders and the grape vine to find out what everyone else was doing. Another serious problem was not knowing / forgetting that we had some tool that would've been very useful when the on-call pager went off at fuck you dark thirty.

A reason we could build that map in System-Z (the predecessor of Backstage) is that our (sort of) HTTP/2 had a feature to tell us who had called methods on a service. (you could get the same from munging access logs, if you have them)

Anyway, the key features were that you could see what services your service was calling, who was calling you, and how those other systems were doing, and that you could see all the tools (e.g. build, logs, monitoring) your service was connected to. (for the ops / on-call use case)

A lot of those tool integrations were just links to "blahchat/#team", "themonitoring/theservice?alerts=all" or whatever, to hotlink directly into the right place.

It was built on an opt-in philosophy, where "blahchat/#team" was the default, but if (you're John-John and) you insist that the channel for ALF has to be #melmac, you can have that, but you have to add it yourself.

More recently, I've seen swagger/openapi used to great effect. I still want the map of who's calling who and I strongly recommend mechanicanizing how that's made. (extract it from logs or something, don't rely on hand-drawn maps) I want to like C4, but I haven't managed to get any use out of it. Just throw it in graphviz dot-file.

Oh, one trick that's useful there: local maps. For each service S, get the list of everything that connects to it. Make a subset graph of those services, but make sure to include the other connections between those, the ones that don't involve S. ("oh, so that's why...")

[–] Falst 1 points 9 months ago

If you don’t mind the runtime overhead OpenTelemetry would do the job (with maybe some sort of manual instrumentation for things like timers) and builds a service map.

IMO however if your services are closely tied together then how about grouping them together into one or multiple mono-repositories ? Or at least start designing your bounded contexts so that documenting by hand doesn’t become a maintenance burden.

[–] [email protected] 1 points 9 months ago* (last edited 9 months ago) (1 children)

OpenAPI will let you generate both controller and swagger documentation from a single yaml configuration. That's probably not the whole answer, but it's the hard part. Then you just need something to index all the swagger docs.

This presumes Java. I don't know about other ecosystems.

[–] [email protected] 1 points 9 months ago (1 children)

OpenAPI unfortunately doesn't provide an overview of the different microservices. For example I won't see that $ServiceConsumer is a consumer of messages from $ServiceProducer. It only gets more complicated the more microservices are added.

It could be part of the code generation solution, but I do wonder if OpenAPI is the only solution out there.

Anti Commercial-AI license

[–] [email protected] 2 points 9 months ago (1 children)

I would create a Jenkins task that runs during deployment that does whatever magical thing that updates your central index. That's going to be implementation-dependent. I once worked on a custom workflow and documentation repository that did basically this, but I don't have more info because I was only there a few weeks before getting moved to the contract I had actually been hired for. It would've been more complicated because they had api preview docs for things still under development.

Point is it was a custom solution and I'm not aware of an existing product.

[–] [email protected] 1 points 9 months ago

I don't mind using multiple compatible technologies. OpenAPI for services and something else that consumes an OpenAPI JSON or even JSON Schema to connect the different projects together and provide an overview.

Anti Commercial-AI license