jasterv/grpc-slides

Fork 0

mirror of https://codeberg.org/JasterV/grpc-slides.git synced 2026-04-26 18:40:03 +00:00

JasterV e2ddde4de7 Update notes

2025-06-11 09:30:27 +02:00

17 KiB

Raw Blame History

Learning gRPC

Victor Martinez

First, what is RPC?

An idea to extend transfer of control and transmission of data from one machine to another.

http://birrell.org/andrew/papers/ImplementingRPC.pdf

note: At the time, building applications that required communicating with a separate machine was difficult and required big expertise, so much that only a few network experts were designated to. One of the aims of this RPC implementation was to make it highly efficient (network-wise) as well as as simple to use as non-remote procedures.

They believed that by providing a simple interface and tool for machine to machine communications, it would make it more accessible for less expert people to implement distributed applications.

Also, they aimed to provide secure communications with RPC. At the time, none of the implemented protocols inside their network had no security at all to the point where passwords were being sent as plain text.

The concept dates back to 1976 [1]

[1] WHITE, J. E. A high-level framework for network-based resource sharing. In Proc. National Computer Conference, (June 1976).

http://birrell.org/andrew/papers/ImplementingRPC.pdf

note:

Back in the time they already used a tool to auto-generate the client and server stubs:

But the user-stub and server-stub are automatically generated, by a program called Lupine.

Interface Definition Language

struct Phone {
  1: i32 id,
  2: string number,
}

service PhoneService {
  Phone findById(1: i32 id),
  list<Phone> findAll()
}

An example of Thrift, an IDL used in Facebook's RPC framework

https://github.com/facebook/fbthrift

note:

Many IDLs have been developed over time. Mozilla, Microsoft, IBM... and more developed their own internal RPC frameworks with their own IDLs [2]

In the paper mentioned above, they wrote the interface using the Mesa interface modules feature:

This generation is specified by use of Mesa interface modules. These are the basis of the Mesa (and Cedar) separate compilation and binding mechanism [9]. An interface module is mainly a list of procedure names, together with the types of their arguments and results

[2] https://en.wikipedia.org/wiki/Interface_description_language

gRPC is a modern open source high performance Remote Procedure Call (RPC) framework that can run in any environment.

https://grpc.io/

note:

google Remote procedure calls

"gRPC was initially created by Google, which has used a single general-purpose RPC infrastructure called Stubby to connect the large number of microservices running within and across its data centers. In March 2015, Google decided to build the next version of Stubby and make it open source. The result was gRPC"

Why a framework?

gRPC dictates how you will build your network interface.

Code is generated for you batteries included, you must only fill the gaps.

note:

All the underlying details about networking, encoding & more is handled for you.

It is more a framework in the sense of servers. They must use the generated Server Stub, with the only need of implementing the Service interfaces.

Clients will use the generated client Stub. For them the gRPC code will be less intrusive and will feel more like a library

Some implementations wrap the original C library, some don't.

Built on top of HTTP2

So we get for free

Multiplexing
Header compression
Server push
TLS

note:

Explain multiplexing and server push

4 types of RPC supported

note:

Explain that each of these RPC types can be specified on the protobuffers IDL

Metadata

Key-value pairs of data used to provide additional information about a call.

Implemented using HTTP/2 headers.

https://github.com/grpc/grpc/blob/master/doc/PROTOCOL-HTTP2.md

note:

gRPC metadata can be sent and received by both the client and the server. Headers are sent from the client to the server before the initial request and from the server to the client before the initial response of an RPC call.

On the link I show, they document the supported values for metadata

Can be useful for: Authentication & tracing

And many more features

Flow control for streaming
RPC automatic & manual cancellations
Reflection (Service discoverability & ease debugging)
Load balancing (Client requests can be load balanced between multiple servers)
Call retries
Health checking (Service-specific health checking)
Interceptors (Middleware for RPCs)

note:

It is important to explain that these features might differ from language to language, since it depends completely on how each of them implements gRPC

Flow control is a mechanism to ensure that a receiver of messages does not get overwhelmed by a fast sender. Flow control prevents data loss, improves performance and increases reliability.
Reflection: Explain that we won't go in detail about reflection but that I believe we should research more about it since it can be useful for better developer experience
Health check: gRPC specifies a standard service API (health/v1) for performing health check calls against gRPC servers. An implementation of this service is provided, but you are responsible for updating the health status of your services. It is pluggable, and some languages might not provide it.

Protocol buffers

Protocol Buffers are language-neutral, platform-neutral extensible mechanisms for serializing structured data.

https://protobuf.dev/

note:

Explain that it is the default binary serialization format supported by gRPC

It is also developed by google.

They are a combination of

The Interface Definition Language
The compiler that generates code from IDL files
Language-specific runtimes
The serialization format

note:

Here we will focus on the IDL and the tooling, we won't focus on the serialization format.

Protobufs as an Interface Definition Language

Defining a service

// service/v1/service.proto
syntax = "proto3";

package service.v1;

import "amend_termination/request/v1/request.proto";
import "amend_termination/response/v1/response.proto";

service PolicyManagementService {
  rpc AmendTermination(amend_termination.request.v1.AmendTerminationRequest) returns (amend_termination.response.v1.AmendTerminationResponse);
}

Defining messages

// amend_termination/request/v1/request.proto
syntax = "proto3";

package amend_termination.request.v1;

import "terminate_policy/request/v1/request.proto";
import "google/protobuf/timestamp.proto";

message AmendTerminationRequest {
  string policy_id = 1;
  google.protobuf.Timestamp requested_at = 2;
  google.protobuf.Timestamp interruption_at = 3;
  optional string description = 4;
  oneof reason {
    terminate_policy.request.v1.CustomerTerminateReason customer = 5;
    terminate_policy.request.v1.PrimaTerminateReason prima = 6;
  }
}

The protoc compiler

Compiles .proto files into code. Supports plugins for different languages.

protoc --proto_path=src --python_out=build/gen src/foo.proto

note:

--proto_path specifies the source directory, --*_out the destination directory, and the rest is the path to your .proto

Buf CLI

A linter for proto files
A formatter for proto files
A system to organize your proto files by workspaces
A feature to check for breaking changes in your definitions
A plugin system to compile proto files into multiple formats
Editor integration
And more!

https://buf.build/product/cli

note:

Explain that it builds on top of protoc. Be very short here, just mention the tool briefly. It is important because we use it.

Buf CLI

buf format

buf lint

buf breaking --against ".git#branch=master"

Remarkable features of Protocol buffers

Strongly typed data
Language and platform neutral
Compact binary format
Backward and Forward compatibility
Support for RPC service definition

note:

Give a short example of why it is backward and forward compatible. Mention tags.

gRPC in the Rust ecosystem

❤️

Tonic

https://github.com/hyperium/tonic

note:

Built on top of Tower, Tonic is a gRPC over HTTP/2 implementation focused on high performance, interoperability, and flexibility.

It has first class support for async/await.

The main goal of tonic is to provide a generic gRPC implementation over HTTP/2 framing.

Codegen tools need to be used to generate the client and server stubs that will encode and decode the binary data and deal with other gRPC features such as streaming.

Features

TLS
Load balancing
RPC cancellation via timeouts
Request/Response compression
Bidirectional streaming
Health check of services
Interceptors
Reflection
Client & Server stub generation
Extensible via Tower services

note:

These are only a few notable features, it provides more for sure

Generate code from Proto definitions ⚙️

// build.rs

let mut prost_build = prost_build::Config::new();

prost_build.compile_protos(
    &["<path_to_proto_messages>"],
    &["proto"],
)?;

tonic_build::configure()
    .compile_protos(
        &["proto/es_policy_grpc/service/v1/service.proto"],
        &["proto"],
    )?;

note:

First we need to talk about how do we generate code from our protobuf definitions.

Expose the generated code as a library

// lib.rs

pub mod policy_service {
    pub mod v1 {
        include!(concat!(env!("OUT_DIR"), "/es_policy_grpc.service.v1.rs"));
    }
}

note:

We need to expose the generated code through our lib.rs

Auto generated services

pub trait PolicyManagementService {
    async fn withdraw_policy(
        &self,
        request: Request<WithdrawPolicyRequest>,
    ) -> Result<Response<WithdrawPolicyResponse>, Status>
	// ...
}

note:

We get a trait generated from the Protobuf Service definition

Building a server

// main.rs

let server = 
	// gRPC server implemented on top of HTTP2
	Server::builder() 
		.add_service(
			// Policy Management Server Stub
			PolicyManagementServiceServer::new(
				// Implementation of the service
				PolicyManagementServiceImpl::new(application)
			) 
	);

let listener = TcpListener::bind(("0.0.0.0", grpc_port)).await?;

server.serve(listener).await?;

note:

Simple build of a Tonic Server. We will dive into how to add middleware later.

Highlight the fact that at the end of the day the gRPC server will be listening to a TCP port like any other HTTP2 server.

Building a client

let mut client = 
	// Auto-generated client stub
	PolicyManagementServiceClient::connect("http://[::1]:50051").await?;

let mut request = tonic::Request::new(GenerateContractRequest {
    // ..
});

let token: MetadataValue<_> = "Bearer some-auth-token".parse()?;

request.metadata_mut.insert("authentication", token);

let _response = client.generate_contract(request).await?;

note:

What if we wanted to add those headers for every request? Now we talk about interceptors

Interceptors

Interceptors are similar to middleware but with less flexibility. They allow you to:

Add/remove/check items in the metadata of each request.
Cancel a request with a Status.

Interceptors in practice

fn check_auth(req: Request<()>) -> Result<Request<()>, Status> {
    match req.metadata().get("authorization") {
        Some(t) if is_valid(t) => Ok(req),
        _ => Err(Status::unauthenticated("No valid auth token")),
    }
}

let svc = PolicyManagementServiceServer::with_interceptor(
	PolicyManagementServiceImpl::new(application),
	check_auth
);

Health checking gRPC services

Tonic provides a health check service implementing a standard gRPC health checking protocol.

https://github.com/grpc/grpc/blob/master/doc/health-checking.md

note:

A GRPC service is used as the health checking mechanism.

Since it is a GRPC service itself, doing a health check is in the same format as a normal rpc.

It has rich semantics such as per-service health status.

The server has full control over the access of the health checking service.

Health service definition

service Health {
  rpc Check(HealthCheckRequest) returns (HealthCheckResponse);

  rpc Watch(HealthCheckRequest) returns (stream HealthCheckResponse);
}

This definition is provided by the official gRPC docs, each language runtime might implement it or not.

https://github.com/grpc/grpc/blob/master/doc/health-checking.md --- ### Enabling the health service

let (health_reporter, health_service) = health_reporter();

health_reporter
    .set_serving::<PolicyManagementServiceServer<PolicyManagementServiceImpl>>()
    .await;

Server::builder()
	// Add other layers
	.layer(..)
	.add_service(health_service)
	.serve(addr)
	.await?;

note:

Make it clear that we are using the tonic-health crate which doesn't come by default with tonic.

What about more complex middleware? What if we need to also intercept responses?

Let's dive into Tower

Tower

note:

Tower is a library of modular and reusable components for building robust networking clients and servers.

Tonic is built on top of Tower

It's core abstraction is the Service, which we see in the next slide.

It exposes already a set of basic reusable services to solve common networking patterns such as timeouts and rate limiting.

Tower service

pub trait Service<Request> {
    type Response;
    type Error;
    type Future: Future<Output = Result<Self::Response, Self::Error>>;

    fn poll_ready(
        &mut self,
        cx: &mut Context<'_>,
    ) -> Poll<Result<(), Self::Error>>;
    
    fn call(&mut self, req: Request) -> Self::Future;
}

note:

Tower’s fundamental abstraction.

An asynchronous function from a Request to a Response.

The Service trait is a simplified interface making it easy to write network applications in a modular and reusable way, decoupled from the underlying protocol.

It immediately returns a Future representing the eventual completion of processing the request.

The processing may depend on calling other services. At some point in the future, the processing will complete, and the Future will resolve to a response or error.

Layers

pub trait Layer<S> {
    type Service; // This can be a middleware
    
    fn layer(&self, inner: S) -> Self::Service;
}

note:

Mechanism to layer services. It allows us to wrap a generic service with another one. It can be used to wrap a reusable service which is meant to act as a middleware around another service.

Building a layered service

ServiceBuilder::new()
    .timeout(Duration::from_secs(10))
    .layer(OpenTelemetryServerTracingLayer::new_for_grpc())
    .layer(JwtAuthLayer::new(jwks_client, "starsky"))
    .named_layer(StarskyServer::new(starsky_service));

note:

A real example of a layered service from Starsky. Slightly simplified for the sake of the presentation. The flow will be the following: Timeout -> SSRHL -> Tracing -> SSRHL -> Auth -> Starsky service

---

Now let's dive into real middleware implementations

Authentication Layer

TODO

Tracing Layer

TODO

17 KiB Raw Blame History Unescape Escape

Learning gRPC

First, what is RPC?

Interface Definition Language

Why a framework?

Built on top of HTTP2

4 types of RPC supported

Metadata

And many more features

Protocol buffers

They are a combination of

Protobufs as an Interface Definition Language

Defining a service

Defining messages

The protoc compiler

Buf CLI

Buf CLI

Remarkable features of Protocol buffers

gRPC in the Rust ecosystem

Tonic

Features

Generate code from Proto definitions ⚙️

Expose the generated code as a library

Auto generated services

Building a server

Building a client

Interceptors

Interceptors in practice

Health checking gRPC services

Health service definition

Tower

Tower service

Layers

Building a layered service

Authentication Layer

Tracing Layer

17 KiB

Raw Blame History