May 21, 2024


First, we’ll push a message with the article URL to a Pub/Sub subject (a message queue). The recommender service pulls the message from the queue to be able to course of it. Earlier than this service can advocate a ballot, it wants to finish just a few steps, which we’ve separated out into particular person companies. The recommender service sends a request to those companies one-by-one and shops the leads to a Redis retailer. These are the steps: 

  1. The article scraper service scrapes (downloads and parses) the article textual content from the URL.

  2. The encoder service encodes the textual content into textual content embeddings (we use the common sentence encoder).

  3. The model security service detects if the article textual content contains descriptions of tragic occasions, equivalent to loss of life, homicide, or accidents, as a result of we don’t need to add our polls into these articles. 

With these three steps accomplished, the advice service can advocate a ballot from our database of pre-existing polls, and submit it to an inside database we name Rec Retailer. That is how we find yourself recommending including a ballot about introducing a velocity restrict on the German Autobahn.

Why we determined to maneuver to Cloud Run

Cloud Run appeared enticing to us for 2 causes. First, as a result of it routinely scales down all the way in which to zero container situations if there are not any requests, we anticipated we’d save prices (and we did!). Second, we appreciated the concept of operating our code on a fully-managed platform with out having to fret in regards to the underlying infrastructure, particularly since our group doesn’t have a devoted knowledge engineer (we’re each knowledge scientists).

As a fully-managed platform, Cloud Run has been designed to make builders extra productive. It’s a serverless platform that allows you to run your code in containers, straight on high of Google’s infrastructure. Deployments are quick and automatic. Fill in your container picture URL and seconds later your code is serving requests. 

Cloud Run routinely provides extra container situations to deal with all incoming requests or occasions, and removes them once they’re now not wanted. That’s cost-efficient, and on high of that Cloud Run doesn’t cost you for the sources a container makes use of if it’s not serving requests. 

The pay-for-use value mannequin was the principle motivation for us emigrate away from GKE. We solely need to pay for the sources we use – and never for a big idle cluster throughout the evening.

Enabling the migration to Cloud Run with just a few modifications

To maneuver our companies from GKE to Cloud Run, we needed to make just a few modifications. 

  • Change the Pub/Sub subscriptions from pull to push. 

  • Migrate our self-managed Redis database within the cluster to a fully-managed Cloud Memorystore occasion. 

That is how our preliminary goal structure seems to be in a diagram: 


Source link