Skip to Content

Machine Learning

2 posts

The rise of nanoservices

The rise of nanoservices

While microservices are motivated by domain driven design, which defines bounded business domains and ultimately manifests itself via loosely coupled services that can be independently updated, nanoservices are a response to the needs of machine learning engineering teams to manage an increasing number of models that act as a single

An inventory of transformers inference optimisation methods in the HuggingFace echosystem

An inventory of transformers inference optimisation methods in the HuggingFace echosystem

Latency is one of the main challenges to making machine learning impactful for an organisation. Depending on the latency requirements and the inference methods, the emphasis on latency can be either about cost efficiency and / or about scalability. Machine Learning Engineering teams need to be able to provide value and