pw_protobuf#

The protobuf module provides an expressive interface for encoding and decoding the Protocol Buffer wire format with a lightweight code and data footprint.

Note

The protobuf module is a work in progress. Wire format encoding and decoding is supported, though the APIs are not final. C++ code generation exists for encoding and decoding, but not yet optimized for in-memory decoding.

Overview#

Unlike protobuf libraries which require protobuf messages be represented by in-memory data structures, pw_protobuf provides a progressive flexible API that allows the user to choose the data storage format and tradeoffs most suitable for their product on top of the implementation.

The API is designed in three layers, which can be freely intermixed with each other in your code, depending on point of use requirements:

  1. Message Structures,

  2. Per-Field Writers and Readers,

  3. Direct Writers and Readers.

This has a few benefits. The primary one is that it allows the core proto serialization and deserialization libraries to be relatively small.

Label

Segment

Delta

Full wire-format proto encode/decode library

FLASH

NEW

+822

pw::protobuf::StreamDecoder::ReadRepeatedFixedField<>()

NEW

+738

pw::protobuf::StreamDecoder::ReadRepeatedVarintField<>()

NEW

+654

pw::protobuf::StreamEncoder::WritePackedVarints<>()

NEW

+228

pw::protobuf::StreamDecoder::ReadFieldKey()

NEW

+224

pw::protobuf::StreamEncoder::WriteLengthDelimitedFieldFromStream()

NEW

+208

pw::protobuf::StreamDecoder::ReadOneVarint()

NEW

+208

pw::protobuf::StreamEncoder::GetNestedEncoder()

NEW

+196

pw::protobuf::StreamEncoder::WritePackedFixed()

NEW

+176

pw::protobuf::StreamDecoder::ReadPackedVarintField()

NEW

+168

pw::protobuf::Decoder::FieldSize()

NEW

+168

pw::protobuf::StreamDecoder::SkipField()

NEW

+120

pw::protobuf::StreamEncoder::CloseNestedMessage()

NEW

+112

pw::protobuf::Decoder::ConsumeKey()

NEW

+110

pw::protobuf::StreamDecoder::ReadPackedFixedField()

NEW

+108

pw::protobuf::StreamEncoder::UpdateStatusForWrite()

NEW

+104

pw::protobuf::StreamDecoder::ReadDelimitedField()

NEW

+100

pw::protobuf::StreamDecoder::GetBytesReader()

NEW

+100

pw::protobuf::StreamDecoder::ReadVarintField<>()

NEW

+98

pw::protobuf::StreamDecoder::ReadFixedField()

NEW

+96

pw::protobuf::Decoder::ReadDelimited()

NEW

+90

pw::protobuf::StreamDecoder::Advance()

NEW

+90

pw::protobuf::StreamDecoder::BytesReader::DoRead()

NEW

+90

pw::protobuf::StreamDecoder::BytesReader::DoSeek()

NEW

+88

pw::protobuf::StreamDecoder::CloseNestedDecoder()

NEW

+84

pw::protobuf::StreamDecoder::ReadVarintField()

NEW

+84

pw::protobuf::StreamEncoder::WriteFixed()

NEW

+80

pw::protobuf::StreamDecoder::Next()

NEW

+78

pw::protobuf::StreamEncoder::WriteVarintField()

NEW

+74

pw::protobuf::Decoder::ReadFixed()

NEW

+70

pw::protobuf::FieldKey::FieldKey()

NEW

+70

pw::protobuf::StreamEncoder::WriteLengthDelimitedField()

NEW

+68

pw::protobuf::StreamEncoder::WriteVarint()

NEW

+64

pw::protobuf::StreamDecoder::CheckOkToRead()

NEW

+60

pw::protobuf::Decoder::FieldNumber()

NEW

+60

pw::protobuf::StreamDecoder::~StreamDecoder()

NEW

+58

pw::protobuf::Decoder::ReadVarint()

NEW

+54

pw::protobuf::Decoder::SkipField()

NEW

+54

pw::protobuf::FieldKey::IsValidKey()

NEW

+52

pw::protobuf::Decoder::ReadSint64()

NEW

+52

pw::protobuf::Decoder::ReadUint32()

NEW

+50

pw::protobuf::Decoder::ReadSint32()

NEW

+48

pw::protobuf::StreamEncoder::CloseEncoder()

NEW

+46

pw::protobuf::Decoder::Next()

NEW

+46

pw::protobuf::WriteVarint()

NEW

+44

pw::protobuf::StreamDecoder::BytesReader::~BytesReader()

NEW

+44

pw::protobuf::StreamDecoder::CloseBytesReader()

NEW

+44

pw::protobuf::WriteLengthDelimitedKeyAndLengthPrefix()

NEW

+40

pw::protobuf::Decoder::ReadBool()

NEW

+40

pw::protobuf::TagSizeBytes<>()

NEW

+38

pw::protobuf::StreamDecoder::GetLengthDelimitedPayloadBounds()

NEW

+36

pw::protobuf::StreamDecoder::BytesReader

NEW

+32

pw::protobuf::Decoder::ReadString()

NEW

+32

pw::protobuf::ValidFieldNumber()

NEW

+22

pw::protobuf::StreamEncoder::WriteZigzagVarint()

+6,720

Including table-based Message encoder and decoder

FLASH

NEW

+1,264

pw::protobuf::StreamEncoder::Write()

NEW

+1,064

pw::protobuf::StreamDecoder::Read()

NEW

+822

pw::protobuf::StreamDecoder::ReadRepeatedFixedField<>()

NEW

+738

pw::protobuf::StreamDecoder::ReadRepeatedVarintField<>()

NEW

+654

pw::protobuf::StreamEncoder::WritePackedVarints<>()

NEW

+228

pw::protobuf::StreamDecoder::ReadFieldKey()

NEW

+224

pw::protobuf::StreamEncoder::WriteLengthDelimitedFieldFromStream()

NEW

+208

pw::protobuf::StreamDecoder::ReadOneVarint()

NEW

+208

pw::protobuf::StreamEncoder::GetNestedEncoder()

NEW

+196

pw::protobuf::StreamEncoder::WritePackedFixed()

NEW

+176

pw::protobuf::StreamDecoder::ReadPackedVarintField()

NEW

+168

pw::protobuf::Decoder::FieldSize()

NEW

+168

pw::protobuf::StreamDecoder::SkipField()

NEW

+120

pw::protobuf::StreamDecoder::GetNestedDecoder()

NEW

+120

pw::protobuf::StreamEncoder::CloseNestedMessage()

NEW

+112

pw::protobuf::Decoder::ConsumeKey()

NEW

+110

pw::protobuf::StreamDecoder::ReadPackedFixedField()

NEW

+108

pw::protobuf::StreamEncoder::UpdateStatusForWrite()

NEW

+104

pw::protobuf::StreamDecoder::ReadDelimitedField()

NEW

+100

pw::protobuf::StreamDecoder::GetBytesReader()

NEW

+100

pw::protobuf::StreamDecoder::ReadVarintField<>()

NEW

+98

pw::protobuf::StreamDecoder::ReadFixedField()

NEW

+96

pw::protobuf::Decoder::ReadDelimited()

NEW

+90

pw::protobuf::StreamDecoder::Advance()

NEW

+90

pw::protobuf::StreamDecoder::BytesReader::DoRead()

NEW

+90

pw::protobuf::StreamDecoder::BytesReader::DoSeek()

NEW

+88

pw::protobuf::StreamDecoder::CloseNestedDecoder()

NEW

+84

pw::protobuf::StreamDecoder::ReadVarintField()

NEW

+84

pw::protobuf::StreamEncoder::WriteFixed()

NEW

+80

pw::protobuf::StreamDecoder::Next()

NEW

+78

pw::protobuf::StreamEncoder::WriteVarintField()

NEW

+74

pw::protobuf::Decoder::ReadFixed()

NEW

+70

pw::protobuf::FieldKey::FieldKey()

NEW

+70

pw::protobuf::StreamEncoder::WriteLengthDelimitedField()

NEW

+68

pw::protobuf::StreamEncoder::WriteVarint()

NEW

+64

pw::protobuf::StreamDecoder::CheckOkToRead()

NEW

+60

pw::protobuf::Decoder::FieldNumber()

NEW

+60

pw::protobuf::StreamDecoder::~StreamDecoder()

NEW

+58

pw::protobuf::Decoder::ReadVarint()

NEW

+54

pw::protobuf::Decoder::SkipField()

NEW

+54

pw::protobuf::FieldKey::IsValidKey()

NEW

+52

pw::protobuf::Decoder::ReadSint64()

NEW

+52

pw::protobuf::Decoder::ReadUint32()

NEW

+50

pw::protobuf::Decoder::ReadSint32()

NEW

+48

pw::protobuf::StreamEncoder::CloseEncoder()

NEW

+46

pw::protobuf::Decoder::Next()

NEW

+46

pw::protobuf::WriteVarint()

NEW

+44

pw::protobuf::StreamDecoder::BytesReader::~BytesReader()

NEW

+44

pw::protobuf::StreamDecoder::CloseBytesReader()

NEW

+44

pw::protobuf::WriteLengthDelimitedKeyAndLengthPrefix()

NEW

+42

pw::protobuf::TagSizeBytes<>()

NEW

+40

pw::protobuf::Decoder::ReadBool()

NEW

+38

pw::protobuf::StreamDecoder::GetLengthDelimitedPayloadBounds()

NEW

+36

pw::protobuf::StreamDecoder::BytesReader

NEW

+32

pw::protobuf::Decoder::ReadString()

NEW

+32

pw::protobuf::ValidFieldNumber()

NEW

+22

pw::protobuf::StreamEncoder::WriteZigzagVarint()

+9,170

To demonstrate these layers, we use the following protobuf message definition in the examples:

message Customer {
  enum Status {
    NEW = 1;
    ACTIVE = 2;
    INACTIVE = 3;
  }
  int32 age = 1;
  string name = 2;
  Status status = 3;
}

And the following accompanying options file:

Customer.name max_size:32

Message Structures#

Pigweed AI summary: This section explains how message structures are created through C++ code generation and integrated with Pigweed's build system. It provides an example of a generated structure and how it can be encoded and decoded. The section also covers message comparison and buffer sizes, including how to handle fields with unknown maximum lengths or variable numbers of repeated submessages. It warns that encoding to an insufficiently large buffer will result in an error and invalid encoded data.

The highest level API is based around message structures created through C++ code generation, integrated with Pigweed’s build system.

This results in the following generated structure:

enum class Customer::Status : uint32_t {
  NEW = 1,
  ACTIVE = 2,
  INACTIVE = 3,

  kNew = NEW,
  kActive = ACTIVE,
  kInactive = INACTIVE,
};

struct Customer::Message {
  int32_t age;
  pw::InlineString<32> name;
  Customer::Status status;
};

Which can be encoded with the code:

#include "example_protos/customer.pwpb.h"

pw::Status EncodeCustomer(Customer::StreamEncoder& encoder) {
  return encoder.Write({
    age = 33,
    name = "Joe Bloggs",
    status = Customer::Status::INACTIVE
  });
}

And decoded into a struct with the code:

#include "example_protos/customer.pwpb.h"

pw::Status DecodeCustomer(Customer::StreamDecoder& decoder) {
  Customer::Message customer{};
  PW_TRY(decoder.Read(customer));
  // Read fields from customer
  return pw::OkStatus();
}

These structures can be moved, copied, and compared with each other for equality.

The encoder and decoder code is generic and implemented in the core C++ module. A small overhead for each message type used in your code describes the structure to the generic encoder and decoders.

Message comparison#

Pigweed AI summary: The article discusses the limitations of using the equality and inequality operators in message structures, which can only work on scalar fields and fixed-size strings. Fields using a callback are not considered in the comparison. To check if the equality operator of a generated message covers all fields, the article suggests using the IsTriviallyComparable function provided by pw_protobuf. The example given shows that while the Point message can be fully compared for equality, the Label message cannot, as it ignores the label string.

Message structures implement operator== and operator!= for equality and inequality comparisons. However, these can only work on trivial scalar fields and fixed-size strings within the message. Fields using a callback are not considered in the comparison.

To check if the equality operator of a generated message covers all fields, pw_protobuf provides an IsTriviallyComparable function.

template <typename Message>
constexpr bool IsTriviallyComparable<Message>();

For example, given the following protobuf definitions:

message Point {
  int32 x = 1;
  int32 y = 2;
}

message Label {
  Point point = 1;
  string label = 2;
}

And the accompanying options file:

Label.label use_callback:true

The Point message can be fully compared for equality, but Label cannot. Label still defines an operator==, but it ignores the label string.

Point::Message one = {.x = 5, .y = 11};
Point::Message two = {.x = 5, .y = 11};

static_assert(pw::protobuf::IsTriviallyComparable<Point::Message>());
ASSERT_EQ(one, two);
static_assert(!pw::protobuf::IsTriviallyComparable<Label::Message>());

Buffer Sizes#

Pigweed AI summary: This section discusses buffer sizes when initializing a MemoryEncoder for encoding protobuf messages. The kMaxEncodedSizeBytes constant represents the maximum encoded size of the message, excluding the contents of any field values which require a callback. The constant includes the maximum length of a field value if specified in the options file, but if not known, the constant includes all relevant overhead and only requires that you add the length of the field values. Insufficiently large buffers will return a Status::ResourceExhausted() error

Initializing a MemoryEncoder requires that you specify the size of the buffer to encode to. The code generation includes a kMaxEncodedSizeBytes constant that represents the maximum encoded size of the protobuf message, excluding the contents of any field values which require a callback.

#include "example_protos/customer.pwpb.h"

std::byte buffer[Customer::kMaxEncodedSizeBytes];
Customer::MemoryEncoder encoder(buffer);
const auto status = encoder.Write({
  age = 22,
  name = "Wolfgang Bjornson",
  status = Customer::Status::ACTIVE
});

// Always check the encoder status or return values from Write calls.
if (!status.ok()) {
  PW_LOG_INFO("Failed to encode proto; %s", encoder.status().str());
}

In the above example, because the name field has a max_size specified in the accompanying options file, kMaxEncodedSizeBytes includes the maximum length of the value for that field.

Where the maximum length of a field value is not known, indicated by the structure requiring a callback for that field, the constant includes all relevant overhead and only requires that you add the length of the field values.

For example if a bytes field length is not specified in the options file, but is known to your code (kMaxImageDataSize in this example being a constant in your own code), you can simply add it to the generated constant:

#include "example_protos/store.pwpb.h"

const std::byte image_data[kMaxImageDataSize] = { ... };

Store::Message store{};
// Calling SetEncoder means we must always extend the buffer size.
store.image_data.SetEncoder([](Store::StreamEncoder& encoder) {
  return encoder.WriteImageData(image_data);
});

std::byte buffer[Store::kMaxEncodedSizeBytes + kMaxImageDataSize];
Store::MemoryEncoder encoder(buffer);
const auto status = encoder.Write(store);

// Always check the encoder status or return values from Write calls.
if (!status.ok()) {
  PW_LOG_INFO("Failed to encode proto; %s", encoder.status().str());
}

Or when using a variable number of repeated submessages, where the maximum number is known to your code but not to the proto, you can add the constants from one message type to another:

#include "example_protos/person.pwpb.h"

Person::Message grandchild{};
// Calling SetEncoder means we must always extend the buffer size.
grandchild.grandparent.SetEncoder([](Person::StreamEncoder& encoder) {
  PW_TRY(encoder.GetGrandparentEncoder().Write(maternal_grandma));
  PW_TRY(encoder.GetGrandparentEncoder().Write(maternal_grandpa));
  PW_TRY(encoder.GetGrandparentEncoder().Write(paternal_grandma));
  PW_TRY(encoder.GetGrandparentEncoder().Write(paternal_grandpa));
  return pw::OkStatus();
});

std::byte buffer[Person::kMaxEncodedSizeBytes +
                 Grandparent::kMaxEncodedSizeBytes * 4];
Person::MemoryEncoder encoder(buffer);
const auto status = encoder.Write(grandchild);

// Always check the encoder status or return values from Write calls.
if (!status.ok()) {
  PW_LOG_INFO("Failed to encode proto; %s", encoder.status().str());
}

Warning

Encoding to a buffer that is insufficiently large will return Status::ResourceExhausted() from Write calls, and from the encoder’s status() call. Always check the status of calls or the encoder, as in the case of error, the encoded data will be invalid.

Per-Field Writers and Readers#

Pigweed AI summary: The middle level API for Pigweed's protobuf encoders uses typed methods to directly write and read each field of a message to the final serialized form. Lightweight wrappers are generated around the core implementation, resulting in no additional binary code. The encoding method can be written using the generated per-field typed writers. Pigweed's protobuf encoders encode directly to the wire format of a proto, making any writes of a value final and unmodifiable in later steps of the encode process.

The middle level API is based around typed methods to write and read each field of the message directly to the final serialized form, again created through C++ code generation.

Encoding#

Pigweed AI summary: This section discusses encoding in Pigweed's protobuf encoders. The StreamEncoder class generates additional methods for encoding a message structure, and there are lightweight wrappers around the core implementation that result in no additional binary code. The encoding method can be written using these methods. Pigweed's protobuf encoders encode directly to the wire format of a proto, and any writes of a value are final and cannot be referenced or modified later in the encode process.

Given the same message structure, in addition to the Write() method that accepts a message structure, the following additional methods are also generated in the typed StreamEncoder class.

There are lightweight wrappers around the core implementation, calling the underlying methods with the correct field numbers and value types, and result in no additional binary code over correctly using the core implementation.

class Customer::StreamEncoder : pw::protobuf::StreamEncoder {
 public:
  // Message Structure Writer.
  pw::Status Write(const Customer::Message&);

  // Per-Field Typed Writers.
  pw::Status WriteAge(int32_t);

  pw::Status WriteName(std::string_view);
  pw::Status WriteName(const char*, size_t);

  pw::Status WriteStatus(Customer::Status);
};

So the same encoding method could be written as:

#include "example_protos/customer.pwpb.h"

Status EncodeCustomer(Customer::StreamEncoder& encoder) {
  PW_TRY(encoder.WriteAge(33));
  PW_TRY(encoder.WriteName("Joe Bloggs"sv));
  PW_TRY(encoder.WriteStatus(Customer::Status::INACTIVE));
}

Pigweed’s protobuf encoders encode directly to the wire format of a proto rather than staging information to a mutable datastructure. This means any writes of a value are final, and can’t be referenced or modified as a later step in the encode process.

Casting between generated StreamEncoder types#

Pigweed AI summary: The pw_protobuf library guarantees that all generated StreamEncoder classes can be converted among each other. It is also safe to convert any MemoryEncoder to any StreamEncoder. This guarantee exists to facilitate the usage of protobuf overlays, which are protobuf message definitions that ensure fields defined in one message will not conflict with fields defined in other messages. The library provides a helper template called StreamEncoderCast<> that simplifies the casting process. However, it is important to note that using this to convert one stream encoder to

pw_protobuf guarantees that all generated StreamEncoder classes can be converted among each other. It’s also safe to convert any MemoryEncoder to any other StreamEncoder.

This guarantee exists to facilitate usage of protobuf overlays. Protobuf overlays are protobuf message definitions that deliberately ensure that fields defined in one message will not conflict with fields defined in other messages.

For example:

// The first half of the overlaid message.
message BaseMessage {
  uint32 length = 1;
  reserved 2;  // Reserved for Overlay
}

// OK: The second half of the overlaid message.
message Overlay {
  reserved 1;  // Reserved for BaseMessage
  uint32 height = 2;
}

// OK: A message that overlays and bundles both types together.
message Both {
  uint32 length = 1;  // Defined independently by BaseMessage
  uint32 height = 2;  // Defined independently by Overlay
}

// BAD: Diverges from BaseMessage's definition, and can cause decode
// errors/corruption.
message InvalidOverlay {
  fixed32 length = 1;
}

The StreamEncoderCast<>() helper template reduces very messy casting into a much easier to read syntax:

#include "pw_protobuf/encoder.h"
#include "pw_protobuf_test_protos/full_test.pwpb.h"

Result<ConstByteSpan> EncodeOverlaid(uint32_t height,
                                     uint32_t length,
                                     ConstByteSpan encode_buffer) {
  BaseMessage::MemoryEncoder base(encode_buffer);

  // Without StreamEncoderCast<>(), this line would be:
  //   Overlay::StreamEncoder& overlay =
  //       *static_cast<Overlay::StreamEncoder*>(
  //           static_cast<pw::protobuf::StreamEncoder*>(&base)
  Overlay::StreamEncoder& overlay =
      StreamEncoderCast<Overlay::StreamEncoder>(base);
  if (!overlay.WriteHeight(height).ok()) {
    return overlay.status();
  }
  if (!base.WriteLength(length).ok()) {
    return base.status();
  }
  return ConstByteSpan(base);
}

While this use case is somewhat uncommon, it’s a core supported use case of pw_protobuf.

Warning

Using this to convert one stream encoder to another when the messages themselves do not safely overlay will result in corrupt protos. Be careful when doing this as there’s no compile-time way to detect whether or not two messages are meant to overlay.

Decoding#

Pigweed AI summary: This section discusses the decoding process in protobuf, which involves using the StreamDecoder class and additional methods to read and parse the message structure. The decoding process is more complex than encoding or using the message structure, as it requires looping through the fields. The section provides sample code for decoding a customer message and warns about the deprecation of the Fields::SNAKE_CASE naming convention, which will soon require explicit setting of a GN variable for transitional support.

For decoding, in addition to the Read() method that populates a message structure, the following additional methods are also generated in the typed StreamDecoder class.

class Customer::StreamDecoder : pw::protobuf::StreamDecoder {
 public:
  // Message Structure Reader.
  pw::Status Read(Customer::Message&);

  // Returns the identity of the current field.
  ::pw::Result<Fields> Field();

  // Per-Field Typed Readers.
  pw::Result<int32_t> ReadAge();

  pw::StatusWithSize ReadName(pw::span<char>);
  BytesReader GetNameReader(); // Read name as a stream of bytes.

  pw::Result<Customer::Status> ReadStatus();
};

Complete and correct decoding requires looping through the fields, so is more complex than encoding or using the message structure.

pw::Status DecodeCustomer(Customer::StreamDecoder& decoder) {
  uint32_t age;
  char name[32];
  Customer::Status status;

  while ((status = decoder.Next()).ok()) {
    switch (decoder.Field().value()) {
      case Customer::Fields::kAge: {
        PW_TRY_ASSIGN(age, decoder.ReadAge());
        break;
      }
      case Customer::Fields::kName: {
        PW_TRY(decoder.ReadName(name));
        break;
      }
      case Customer::Fields::kStatus: {
        PW_TRY_ASSIGN(status, decoder.ReadStatus());
        break;
      }
    }
  }

  return status.IsOutOfRange() ? OkStatus() : status;
}

Warning

Fields::SNAKE_CASE is deprecated. Use Fields::kCamelCase.

Transitional support for Fields::SNAKE_CASE will soon only be available by explicitly setting the following GN variable in your project: pw_protobuf_compiler_GENERATE_LEGACY_ENUM_SNAKE_CASE_NAMES=true

This support will be removed after downstream projects have been migrated.

Reading a single field#

Pigweed AI summary: The paragraph discusses how to read a single field from a serialized message using the pw_protobuf library. It mentions that setting up a decoder and iterating through the message can be cumbersome, so the library provides convenient Find*() functions for most fields in a message. It provides code examples for reading a single field from a serialized message and also mentions that the Find() APIs work with streamed data. The paragraph also includes a note stating that if multiple fields need to be read, it is more efficient to instantiate

Sometimes, only a single field from a serialized message needs to be read. In these cases, setting up a decoder and iterating through the message is a lot of boilerplate. pw_protobuf generates convenient Find*() functions for most fields in a message which handle this for you.

pw::Status ReadCustomerData(pw::ConstByteSpan serialized_customer) {
  pw::Result<uint32_t> age = Customer::FindAge(serialized_customer);
  if (!age.ok()) {
    return age.status();
  }

  // This will scan the buffer again from the start, which is less efficient
  // than writing a custom decoder loop.
  pw::Result<std::string_view> name = Customer::FindName(serialized_customer);
  if (!age.ok()) {
    return age.status();
  }

  DoStuff(age, name);
  return pw::OkStatus();
}

The Find APIs also work with streamed data, as shown below.

pw::Status ReadCustomerData(pw::stream::Reader& customer_stream) {
  pw::Result<uint32_t> age = Customer::FindAge(customer_stream);
  if (!age.ok()) {
    return age.status();
  }

  // This will begin scanning for `name` from the current position of the
  // stream (following the `age` field). If `name` appeared before `age` in
  // the serialized data, it will not be found.
  //
  // Note that unlike with the buffer APIs, stream Find methods copy `string`
  // and `bytes` fields into a user-provided buffer.
  char name[32];
  pw::StatusWithSize sws = Customer::FindName(serialized_customer, name);
  if (!sws.ok()) {
    return sws.status();
  }
  if (sws.size() >= sizeof(name)) {
    return pw::Status::OutOfRange();
  }
  name[sws.size()] = '\0';

  DoStuff(age, name);
  return pw::OkStatus();
}

Note

Each call to Find*() linearly scans through the message. If you have to read multiple fields, it is more efficient to instantiate your own decoder as described above. Additionally, to avoid confusion, Find*() methods are not generated for repeated fields.

Direct Writers and Readers#

The lowest level API is provided by the core C++ implementation, and requires the caller to provide the correct field number and value types for encoding, or check the same when decoding.

Encoding#

Pigweed AI summary: This section discusses encoding in Protobuf, specifically the two fundamental classes: MemoryEncoder and StreamEncoder. StreamEncoder allows for encoding a proto to something like pw::sys_io without needing to build the complete message in memory. To encode a message, field numbers must be specified and can be hardcoded or supplemented using the C++ code generated Fields enum and cast as an enumerated type. The code examples demonstrate how to use the core API to encode a message using StreamEncoder.

The two fundamental classes are MemoryEncoder which directly encodes a proto to an in-memory buffer, and StreamEncoder that operates on pw::stream::Writer objects to serialize proto data.

StreamEncoder allows you encode a proto to something like pw::sys_io without needing to build the complete message in memory

To encode the same message we’ve used in the examples thus far, we would use the following parts of the core API:

class pw::protobuf::StreamEncoder {
 public:
  Status WriteInt32(uint32_t field_number, int32_t);
  Status WriteUint32(uint32_t field_number, uint32_t);

  Status WriteString(uint32_t field_number, std::string_view);
  Status WriteString(uint32_t field_number, const char*, size_t);

  // And many other methods, see pw_protobuf/encoder.h
};

Encoding the same message requires that we specify the field numbers, which we can hardcode, or supplement using the C++ code generated Fields enum, and cast the enumerated type.

#include "pw_protobuf/encoder.h"
#include "example_protos/customer.pwpb.h"

Status EncodeCustomer(pw::protobuf::StreamEncoder& encoder) {
  PW_TRY(encoder.WriteInt32(static_cast<uint32_t>(Customer::Fields::kAge),
                            33));
  PW_TRY(encoder.WriteString(static_cast<uint32_t>(Customer::Fields::kName),
                             "Joe Bloggs"sv));
  PW_TRY(encoder.WriteUint32(
      static_cast<uint32_t>(Customer::Fields::kStatus),
      static_cast<uint32_t>(Customer::Status::INACTIVE)));
}

Decoding#

Pigweed AI summary: The article discusses decoding using the StreamDecoder class in the core API of pw_protobuf. The StreamDecoder reads data from a pw::stream::Reader and has methods for decoding various data types. To decode a message, one must loop through the fields and check the field numbers, along with casting types. The article provides an example of decoding a customer message using the StreamDecoder class.

StreamDecoder reads data from a pw::stream::Reader and mirrors the API of the encoders.

To decode the same message we would use the following parts of the core API:

class pw::protobuf::StreamDecoder {
 public:
  // Returns the identity of the current field.
  ::pw::Result<uint32_t> FieldNumber();

  Result<int32_t> ReadInt32();
  Result<uint32_t> ReadUint32();

  StatusWithSize ReadString(pw::span<char>);

  // And many other methods, see pw_protobuf/stream_decoder.h
};

As with the typed per-field API, complete and correct decoding requires looping through the fields and checking the field numbers, along with casting types.

pw::Status DecodeCustomer(pw::protobuf::StreamDecoder& decoder) {
  uint32_t age;
  char name[32];
  Customer::Status status;

  while ((status = decoder.Next()).ok()) {
    switch (decoder.FieldNumber().value()) {
      case static_cast<uint32_t>(Customer::Fields::kAge): {
        PW_TRY_ASSIGN(age, decoder.ReadInt32());
        break;
      }
      case static_cast<uint32_t>(Customer::Fields::kName): {
        PW_TRY(decoder.ReadString(name));
        break;
      }
      case static_cast<uint32_t>(Customer::Fields::kStatus): {
        uint32_t status_value;
        PW_TRY_ASSIGN(status_value, decoder.ReadUint32());
        status = static_cast<Customer::Status>(status_value);
        break;
      }
    }
  }

  return status.IsOutOfRange() ? OkStatus() : status;
}

Find APIs#

Sometimes, only a single field from a serialized message needs to be read. In these cases, setting up a decoder and iterating through the message is a lot of boilerplate. pw_protobuf provides convenient Find*() functions which handle this for you.

pw::Status PrintCustomerAge(pw::ConstByteSpan serialized_customer) {
  pw::Result<uint32_t> age = pw::protobuf::FindUint32(
      serialized_customer, Customer::Fields::kAge);
  if (!age.ok()) {
    return age.status();
  }

  PW_LOG_INFO("Customer's age is %u", *age);
  return pw::OkStatus();
}

Note

Each call to Find*() linearly scans through the message. If you have to read multiple fields, it is more efficient to instantiate your own decoder as described above.

namespace pw#
namespace protobuf#

Functions

inline Result<uint32_t> FindUint32(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a uint32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<uint32_t> FindUint32(ConstByteSpan message, T field)#
inline Result<uint32_t> FindUint32(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for a uint32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<uint32_t> FindUint32(stream::Reader &message_stream, T field)#
inline Result<int32_t> FindInt32(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for an int32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int32_t> FindInt32(ConstByteSpan message, T field)#
inline Result<int32_t> FindInt32(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for an int32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int32_t> FindInt32(stream::Reader &message_stream, T field)#
inline Result<int32_t> FindSint32(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for an sint32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int32_t> FindSint32(ConstByteSpan message, T field)#
inline Result<int32_t> FindSint32(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for an sint32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int32_t> FindSint32(stream::Reader &message_stream, T field)#
inline Result<uint64_t> FindUint64(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a uint64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<uint64_t> FindUint64(ConstByteSpan message, T field)#
inline Result<uint64_t> FindUint64(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for a uint64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<uint64_t> FindUint64(stream::Reader &message_stream, T field)#
inline Result<int64_t> FindInt64(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for an int64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int64_t> FindInt64(ConstByteSpan message, T field)#
inline Result<int64_t> FindInt64(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for an int64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int64_t> FindInt64(stream::Reader &message_stream, T field)#
inline Result<int64_t> FindSint64(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for an sint64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int64_t> FindSint64(ConstByteSpan message, T field)#
inline Result<int64_t> FindSint64(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for an sint64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int64_t> FindSint64(stream::Reader &message_stream, T field)#
inline Result<bool> FindBool(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a bool field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<bool> FindBool(ConstByteSpan message, T field)#
inline Result<bool> FindBool(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for a bool field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<bool> FindBool(stream::Reader &message_stream, T field)#
inline Result<uint32_t> FindFixed32(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a fixed32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<uint32_t> FindFixed32(ConstByteSpan message, T field)#
inline Result<uint32_t> FindFixed32(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for a fixed32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<uint32_t> FindFixed32(stream::Reader &message_stream, T field)#
inline Result<uint64_t> FindFixed64(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a fixed64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<uint64_t> FindFixed64(ConstByteSpan message, T field)#
inline Result<uint64_t> FindFixed64(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for a fixed64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<uint64_t> FindFixed64(stream::Reader &message_stream, T field)#
inline Result<int32_t> FindSfixed32(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for an sfixed32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int32_t> FindSfixed32(ConstByteSpan message, T field)#
inline Result<int32_t> FindSfixed32(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for an sfixed32 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int32_t> FindSfixed32(stream::Reader &message_stream, T field)#
inline Result<int64_t> FindSfixed64(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for an sfixed64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int64_t> FindSfixed64(ConstByteSpan message, T field)#
inline Result<int64_t> FindSfixed64(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for an sfixed64 field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<int64_t> FindSfixed64(stream::Reader &message_stream, T field)#
inline Result<float> FindFloat(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a float field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<float> FindFloat(ConstByteSpan message, T field)#
inline Result<float> FindFloat(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for a float field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<float> FindFloat(stream::Reader &message_stream, T field)#
inline Result<double> FindDouble(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a double field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<double> FindDouble(ConstByteSpan message, T field)#
inline Result<double> FindDouble(stream::Reader &message_stream, uint32_t field_number)#

Scans a serialized protobuf message for a double field.

return:

The field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<double> FindDouble(stream::Reader &message_stream, T field)#
inline Result<std::string_view> FindString(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a string field.

return:

Subspan of the buffer containing the string field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type. NOTE: The returned string is NOT null-terminated.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<std::string_view> FindString(ConstByteSpan message, T field)#
inline StatusWithSize FindString(stream::Reader &message_stream, uint32_t field_number, span<char> out)#

Scans a serialized protobuf message for a stringfield, copying its data into the provided buffer.

return:

The size of the copied data if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type. NOTE: The returned string is NOT null-terminated.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

  • out – The buffer to which to write the string.

inline StatusWithSize FindString(stream::Reader &message_stream, uint32_t field_number, InlineString<> &out)#

Scans a serialized protobuf message for a stringfield, copying its data into the provided buffer.

return:

The size of the copied data if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

  • out – String to which to write the found value.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline StatusWithSize FindString(stream::Reader &message_stream, T field, span<char> out)#
template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline StatusWithSize FindString(stream::Reader &message_stream, T field, InlineString<> &out)#
inline Result<ConstByteSpan> FindBytes(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a bytes field.

return:

Subspan of the buffer containing the bytes field if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<ConstByteSpan> FindBytes(ConstByteSpan message, T field)#
inline StatusWithSize FindBytes(stream::Reader &message_stream, uint32_t field_number, ByteSpan out)#

Scans a serialized protobuf message for a bytes field, copying its data into the provided buffer.

return:

The size of the copied data if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message_stream – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline StatusWithSize FindBytes(stream::Reader &message_stream, T field, ByteSpan out)#
inline Result<ConstByteSpan> FindSubmessage(ConstByteSpan message, uint32_t field_number)#

Scans a serialized protobuf message for a submessage.

return:

Subspan of the buffer containing the submessage if found, or a status indicating the error otherwise:

  • NOT_FOUND - The field is not present.

  • DATA_LOSS - The serialized message not a valid protobuf.

  • FAILED_PRECONDITION - The field exists, but is not the correct type.

Parameters:
  • message – The serialized message to search.

  • field_number – Protobuf field number of the field.

template<typename T, typename = std::enable_if_t<std::is_enum_v<T>>>
inline Result<ConstByteSpan> FindSubmessage(ConstByteSpan message, T field)#
namespace internal#

Functions

Status AdvanceToField(Decoder &decoder, uint32_t field_number)#
Status AdvanceToField(StreamDecoder &decoder, uint32_t field_number)#
template<typename T, auto ReadFn>
Result<T> Find(ConstByteSpan message, uint32_t field_number)#
template<typename T, auto ReadFn>
Result<T> Find(stream::Reader &reader, uint32_t field_number)#

Handling of packages#

Pigweed AI summary: This section explains how package declarations in .proto files are converted to namespace declarations in pw_protobuf. The package name is appended with "::pwpb" to avoid clashes for projects that link against multiple C++ proto libraries in the same library. In some cases, external message references may cause the codegen to emit the package name as "pw::pwpb_xxx::my::cool::project", where "pwpb_xxx" is the name of an unspecified private namespace. Users

Package declarations in .proto files are converted to namespace declarations. Unlike protoc’s native C++ codegen, pw_protobuf appends an additional ::pwpb namespace after the user-specified package name: for example, package my.cool.project becomes namespace my::cool::project::pwpb. We emit a different package name than stated, in order to avoid clashes for projects that link against multiple C++ proto libraries in the same library.

In some cases, pw_protobuf codegen may encounter external message references during parsing, where it is unable to resolve the package name of the message. In these situations, the codegen is instead forced to emit the package name as pw::pwpb_xxx::my::cool::project, where “pwpb_xxx” is the name of some unspecified private namespace. Users are expected to manually identify the intended namespace name of that symbol, as described above, and must not rely on any such private namespaces, even if they appear in codegen output.

Codegen#

Pigweed AI summary: The pw_protobuf codegen integration is supported in GN, Bazel, and CMake. The codegen is available through the *.pwpb sub-target of a pw_proto_library in GN, CMake, and Bazel. The example BUILD.gn file shows how to control where the *.pwpb.h headers end up on the include path. More information on build system integration for pw_protobuf codegen can be found in the pw_protobuf_compiler’s documentation.

pw_protobuf codegen integration is supported in GN, Bazel, and CMake.

This module’s codegen is available through the *.pwpb sub-target of a pw_proto_library in GN, CMake, and Bazel. See pw_protobuf_compiler’s documentation for more information on build system integration for pw_protobuf codegen.

Example BUILD.gn:

import("//build_overrides/pigweed.gni")

import("$dir_pw_build/target_types.gni")
import("$dir_pw_protobuf_compiler/proto.gni")

# This target controls where the *.pwpb.h headers end up on the include path.
# In this example, it's at "pet_daycare_protos/client.pwpb.h".
pw_proto_library("pet_daycare_protos") {
  sources = [
    "pet_daycare_protos/client.proto",
  ]
}

pw_source_set("example_client") {
  sources = [ "example_client.cc" ]
  deps = [
    ":pet_daycare_protos.pwpb",
    dir_pw_bytes,
    dir_pw_stream,
  ]
}

Configuration#

Pigweed AI summary: The "pw_protobuf" library has configuration options for encoding nested messages, including the number of bytes to reserve for the varint submessage length. The library also supports field options for specifying protocol-level limitations, such as maximum number of entries for repeated fields and maximum size of bytes or string fields. These options can be set in a separate ".options" file, which should be listed as inputs when defining "pw_proto_library". The use of a separate options file is driven by the need for proto

pw_protobuf supports the following configuration options.

  • PW_PROTOBUF_CFG_MAX_VARINT_SIZE: When encoding nested messages, the number of bytes to reserve for the varint submessage length. Nested messages are limited in size to the maximum value that can be varint-encoded into this reserved space.

    The values that can be set, and their corresponding maximum submessage lengths, are outlined below.

    MAX_VARINT_SIZE

    Maximum submessage length

    1 byte

    127

    2 bytes

    16,383 or < 16KiB

    3 bytes

    2,097,151 or < 2048KiB

    4 bytes (default)

    268,435,455 or < 256MiB

    5 bytes

    4,294,967,295 or < 4GiB (max uint32_t)

Field Options#

Pigweed AI summary: The "pw_protobuf" tool supports field options for specifying protocol-level limitations, such as maximum number of entries for repeated fields and maximum size of bytes or string fields. These options influence code generation and can be used for host builds using upstream protoc code generation. The options can be queried and validated using the reflection API. An example code snippet is provided.

pw_protobuf supports the following field options for specifying protocol-level limitations, rather than code generation parameters (although they do influence code generation):

  • max_count: Maximum number of entries for repeated fields.

  • max_size: Maximum size of bytes or string fields.

Even though other proto codegen implementations do not respect these field options, they can still compile protos which use these options. This is especially useful for host builds using upstream protoc code generation, where host software can use the reflection API to query for the options and validate messages comply with the specified limitations.

import "pw_protobuf_protos/field_options.proto";

message Demo {
  string size_limited_string = 1 [(pw.protobuf.pwpb).max_size = 16];
};

Options Files#

Pigweed AI summary: This section explains how to configure code generation using a separate ".options" file alongside the relevant ".proto" file. The format of the options file consists of fully qualified field names or patterns followed by one or more options. The valid options include max_count, fixed_count, max_size, fixed_size, and use_callback. The use of a separate options file is driven by the need for proto files to be shared across multiple contexts, such as different hardware components, server-side components, and app components.

Code generation can be configured using a separate .options file placed alongside the relevant .proto file.

The format of this file is a series of fully qualified field names, or patterns, followed by one or more options. Lines starting with # or // are comments, and blank lines are ignored.

Example:

// Set an option for a specific field.
fuzzy_friends.Client.visit_dates max_count:16

// Set options for multiple fields by wildcard matching.
fuzzy_friends.Pet.* max_size:32

// Set multiple options in one go.
fuzzy_friends.Dog.paws max_count:4 fixed_count:true

Options files should be listed as inputs when defining pw_proto_library, e.g.

pw_proto_library("pet_daycare_protos") {
  sources = [
    "pet_daycare_protos/client.proto",
  ]
  inputs = [
    "pet_daycare_protos/client.options",
  ]
}

Valid options are:

  • max_count: Maximum number of entries for repeated fields. When set, repeated scalar fields will use the pw::Vector container type instead of a callback.

  • fixed_count: Specified with max_count to use a fixed length std::array container instead of pw::Vector.

  • max_size: Maximum size of bytes or string fields. When set, bytes fields use pw::Vector and string fields use pw::InlineString instead of a callback.

  • fixed_size: Specified with max_size to use a fixed length std::array container instead of pw::Vector for bytes fields.

  • use_callback: Replaces the structure member for the field with a callback function even where a simpler type could be used. This can be useful to ignore fields, to stop decoding of complex structures if certain values are not as expected, or to provide special handling for nested messages.

Rationale

The choice of a separate options file, over embedding options within the proto file, are driven by the need for proto files to be shared across multiple contexts.

A typical product would require the same proto be used on a hardware component, running Pigweed; a server-side component, running on a cloud platform; and an app component, running on a Phone OS.

While related, each of these will likely have different source projects and build systems.

Were the Pigweed options embedded in the protos, it would be necessary for both the cloud platform and Phone OS to be able to "import pigweed" — and equivalently for options relevant to their platforms in the embedded software project.

Message Structures#

Pigweed AI summary: The C++ code generator creates a struct Message for each protobuf message that can hold the set of values encoded by it, following certain rules. Scalar fields are represented by their appropriate C++ type, enumerations are represented by a code generated namespaced proto enum, and nested messages are represented by their own struct Message provided that a reference cycle does not exist. Optional scalar fields are represented by the appropriate C++ type wrapped in std::optional. Repeated scalar fields are represented by pw::Vector when the

The C++ code generator creates a struct Message for each protobuf message that can hold the set of values encoded by it, following these rules.

  • Scalar fields are represented by their appropriate C++ type.

    message Customer {
      int32 age = 1;
      uint32 birth_year = 2;
      sint64 rating = 3;
      bool is_active = 4;
    }
    
    struct Customer::Message {
      int32_t age;
      uint32_t birth_year;
      int64_t rating;
      bool is_active;
    };
    
  • Enumerations are represented by a code generated namespaced proto enum.

    message Award {
      enum Service {
        BRONZE = 1;
        SILVER = 2;
        GOLD = 3;
      }
      Service service = 1;
    }
    
    enum class Award::Service : uint32_t {
      BRONZE = 1,
      SILVER = 2,
      GOLD = 3,
    
      kBronze = BRONZE,
      kSilver = SILVER,
      kGold = GOLD,
    };
    
    struct Award::Message {
      Award::Service service;
    };
    

    Aliases to the enum values are also included in the “constant” style to match your preferred coding style. These aliases have any common prefix to the enumeration values removed, such that:

    enum Activity {
      ACTIVITY_CYCLING = 1;
      ACTIVITY_RUNNING = 2;
      ACTIVITY_SWIMMING = 3;
    }
    
    enum class Activity : uint32_t {
      ACTIVITY_CYCLING = 1,
      ACTIVITY_RUNNING = 2,
      ACTIVITY_SWIMMING = 3,
    
      kCycling = ACTIVITY_CYCLING,
      kRunning = ACTIVITY_RUNNING,
      kSwimming = ACTIVITY_SWIMMING,
    };
    
  • Nested messages are represented by their own struct Message provided that a reference cycle does not exist.

    message Sale {
      Customer customer = 1;
      Product product = 2;
    }
    
    struct Sale::Message {
      Customer::Message customer;
      Product::Message product;
    };
    
  • Optional scalar fields are represented by the appropriate C++ type wrapped in std::optional. Optional fields are not encoded when the value is not present.

    message Loyalty {
      optional int32 points = 1;
    }
    
    struct Loyalty::Message {
      std::optional<int32_t> points;
    };
    
  • Repeated scalar fields are represented by pw::Vector when the max_count option is set for that field, or by std::array when both max_count and fixed_count:true are set.

    message Register {
      repeated int32 cash_in = 1;
      repeated int32 cash_out = 2;
    }
    
    Register.cash_in max_count:32 fixed_count:true
    Register.cash_out max_count:64
    
    struct Register::Message {
      std::array<int32_t, 32> cash_in;
      pw::Vector<int32_t, 64> cash_out;
    };
    
  • bytes fields are represented by pw::Vector when the max_size option is set for that field, or by std::array when both max_size and fixed_size:true are set.

    message Product {
      bytes sku = 1;
      bytes serial_number = 2;
    }
    
    Product.sku max_size:8 fixed_size:true
    Product.serial_number max_size:64
    
    struct Product::Message {
      std::array<std::byte, 8> sku;
      pw::Vector<std::byte, 64> serial_number;
    };
    
  • string fields are represented by a pw::InlineString when the max_size option is set for that field. The string can hold up to max_size characters, and is always null terminated. The null terminator is not counted in max_size.

    message Employee {
      string name = 1;
    }
    
    Employee.name max_size:128
    
    struct Employee::Message {
      pw::InlineString<128> name;
    };
    
  • Nested messages with a dependency cycle, repeated scalar fields without a max_count option set, bytes and strings fields without a max_size option set, and repeated nested messages, repeated bytes, and repeated strings fields, are represented by a callback.

    You set the callback to a custom function for encoding or decoding before passing the structure to Write() or Read() appropriately.

    message Store {
      Store nearest_store = 1;
      repeated int32 employee_numbers = 2;
      string driections = 3;
      repeated string address = 4;
      repeated Employee employees = 5;
    }
    
    // No options set.
    
    struct Store::Message {
      pw::protobuf::Callback<Store::StreamEncoder, Store::StreamDecoder> nearest_store;
      pw::protobuf::Callback<Store::StreamEncoder, Store::StreamDecoder> employee_numbers;
      pw::protobuf::Callback<Store::StreamEncoder, Store::StreamDecoder> directions;
      pw::protobuf::Callback<Store::StreamEncoder, Store::StreamDecoder> address;
      pw::protobuf::Callback<Store::StreamEncoder, Store::StreamDecoder> employees;
    };
    

    A Callback object can be converted to a bool indicating whether a callback is set.

Message structures can be copied, but doing so will clear any assigned callbacks. To preserve functions applied to callbacks, ensure that the message structure is moved.

Message structures can also be compared with each other for equality. This includes all repeated and nested fields represented by value types, but does not compare any field represented by a callback.

Reserved-Word Conflicts#

Pigweed AI summary: The article discusses how reserved words and macros in C++ can cause naming conflicts with generated symbols, which can lead to compilation failures. To avoid this, symbols are suffixed with underscores, as shown in examples such as Channel.operator and POSIX-signal names. The names Message and Fields are also suffixed with underscores to prevent conflicts with codegen internals. However, the article warns that certain naming patterns, such as identifiers with consecutive underscores or starting with an underscore followed by a capital letter, are reserved for

Generated symbols whose names conflict with reserved C++ keywords or standard-library macros are suffixed with underscores to avoid compilation failures. This can be seen below in Channel.operator, which is mapped to Channel::Message::operator_ to avoid conflicting with the operator keyword.

message Channel {
  int32 bitrate = 1;
  float signal_to_noise_ratio = 2;
  Company operator = 3;
}
struct Channel::Message {
  int32_t bitrate;
  float signal_to_noise_ratio;
  Company::Message operator_;
};

Similarly, as shown in the example below, some POSIX-signal names conflict with macros defined by the standard-library header <csignal> and therefore require underscore suffixes in the generated code. Note, however, that some signal names are left alone. This is because <csignal> only defines a subset of the POSIX signals as macros; the rest are perfectly valid identifiers that won’t cause any problems unless the user defines custom macros for them. Any naming conflicts caused by user-defined macros are the user’s responsibility (https://google.github.io/styleguide/cppguide.html#Preprocessor_Macros).

enum PosixSignal {
  NONE = 0;
  SIGHUP = 1;
  SIGINT = 2;
  SIGQUIT = 3;
  SIGILL = 4;
  SIGTRAP = 5;
  SIGABRT = 6;
  SIGFPE = 8;
  SIGKILL = 9;
  SIGSEGV = 11;
  SIGPIPE = 13;
  SIGALRM = 14;
  SIGTERM = 15;
}
enum class PosixSignal : uint32_t {
  NONE = 0,
  SIGHUP = 1,
  SIGINT_ = 2,
  SIGQUIT = 3,
  SIGILL_ = 4,
  SIGTRAP = 5,
  SIGABRT_ = 6,
  SIGFPE_ = 8,
  SIGKILL = 9,
  SIGSEGV_ = 11,
  SIGPIPE = 13,
  SIGALRM = 14,
  SIGTERM_ = 15,

  kNone = NONE,
  kSighup = SIGHUP,
  kSigint = SIGINT_,
  kSigquit = SIGQUIT,
  kSigill = SIGILL_,
  kSigtrap = SIGTRAP,
  kSigabrt = SIGABRT_,
  kSigfpe = SIGFPE_,
  kSigkill = SIGKILL,
  kSigsegv = SIGSEGV_,
  kSigpipe = SIGPIPE,
  kSigalrm = SIGALRM,
  kSigterm = SIGTERM_,
};

Much like reserved words and macros, the names Message and Fields are suffixed with underscores in generated C++ code. This is to prevent name conflicts with the codegen internals if they’re used in a nested context as in the example below.

message Function {
  message Message {
    string content = 1;
  }

  enum Fields {
    NONE = 0;
    COMPLEX_NUMBERS = 1;
    INTEGERS_MOD_5 = 2;
    MEROMORPHIC_FUNCTIONS_ON_COMPLEX_PLANE = 3;
    OTHER = 4;
  }

  Message description = 1;
  Fields domain = 2;
  Fields codomain = 3;
}
Function.Message.content max_size:128
struct Function::Message_::Message {
  pw::InlineString<128> content;
};

enum class Function::Message_::Fields : uint32_t {
  CONTENT = 1,
};

enum class Function::Fields_ uint32_t {
  NONE = 0,
  COMPLEX_NUMBERS = 1,
  INTEGERS_MOD_5 = 2,
  MEROMORPHIC_FUNCTIONS_ON_COMPLEX_PLANE = 3,
  OTHER = 4,

  kNone = NONE,
  kComplexNumbers = COMPLEX_NUMBERS,
  kIntegersMod5 = INTEGERS_MOD_5,
  kMeromorphicFunctionsOnComplexPlane =
      MEROMORPHIC_FUNCTIONS_ON_COMPLEX_PLANE,
  kOther = OTHER,
};

struct Function::Message {
  Function::Message_::Message description;
  Function::Fields_ domain;
  Function::Fields_ codomain;
};

enum class Function::Fields : uint32_t {
  DESCRIPTION = 1,
  DOMAIN = 2,
  CODOMAIN = 3,
};

Warning

Note that the C++ spec also reserves two categories of identifiers for the compiler to use in ways that may conflict with generated code:

  • Any identifier that contains two consecutive underscores anywhere in it.

  • Any identifier that starts with an underscore followed by a capital letter.

Appending underscores to symbols in these categories wouldn’t change the fact that they match patterns reserved for the compiler, so the codegen does not currently attempt to fix them. Such names will therefore result in non-portable code that may or may not work depending on the compiler. These naming patterns are of course strongly discouraged in any protobufs that will be used with pw_protobuf codegen.

Overhead#

Pigweed AI summary: This section discusses the use of a single encoder and decoder for structures, with a one-time code cost. The code generator creates a description of the structure that the shared encoder and decoder use. The cost of this description is a shared table for each protobuf message definition used, with four words per field within the protobuf message, and an additional word to store the size of the table.

A single encoder and decoder is used for these structures, with a one-time code cost. When the code generator creates the struct Message, it also creates a description of this structure that the shared encoder and decoder use.

The cost of this description is a shared table for each protobuf message definition used, with four words per field within the protobuf message, and an addition word to store the size of the table.

Encoding#

The simplest way to use MemoryEncoder to encode a proto is from its code generated Message structure into an in-memory buffer.

#include "my_protos/my_proto.pwpb.h"
#include "pw_bytes/span.h"
#include "pw_protobuf/encoder.h"
#include "pw_status/status_with_size.h"

// Writes a proto response to the provided buffer, returning the encode
// status and number of bytes written.
pw::StatusWithSize WriteProtoResponse(pw::ByteSpan response) {
  MyProto::Message message{}
  message.magic_number = 0x1a1a2b2b;
  message.favorite_food = "cookies";
  message.calories = 600;

  // All proto writes are directly written to the `response` buffer.
  MyProto::MemoryEncoder encoder(response);
  encoder.Write(message);

  return pw::StatusWithSize(encoder.status(), encoder.size());
}

All fields of a message are written, including those initialized to their default values.

Alternatively, for example if only a subset of fields are required to be encoded, fields can be written a field at a time through the code generated or lower-level APIs. This can be more convenient if finer grained control or other custom handling is required.

#include "my_protos/my_proto.pwpb.h"
#include "pw_bytes/span.h"
#include "pw_protobuf/encoder.h"
#include "pw_status/status_with_size.h"

// Writes a proto response to the provided buffer, returning the encode
// status and number of bytes written.
pw::StatusWithSize WriteProtoResponse(pw::ByteSpan response) {
  // All proto writes are directly written to the `response` buffer.
  MyProto::MemoryEncoder encoder(response);
  encoder.WriteMagicNumber(0x1a1a2b2b);
  encoder.WriteFavoriteFood("cookies");
  // Only conditionally write calories.
  if (on_diet) {
    encoder.WriteCalories(600);
  }
  return pw::StatusWithSize(encoder.status(), encoder.size());
}

StreamEncoder#

Pigweed AI summary: The StreamEncoder is a tool used to encode data into a destination stream, and it requires a scratch buffer to handle nested submessages. This code example shows how to use the StreamEncoder to write a timestamp and a welcome message to a SysIoWriter, with no intermediate buffering. If there is an error during encoding, it will be logged.

StreamEncoder is constructed with the destination stream, and a scratch buffer used to handle nested submessages.

#include "my_protos/my_proto.pwpb.h"
#include "pw_bytes/span.h"
#include "pw_protobuf/encoder.h"
#include "pw_stream/sys_io_stream.h"

pw::stream::SysIoWriter sys_io_writer;
MyProto::StreamEncoder encoder(sys_io_writer, pw::ByteSpan());

// Once this line returns, the field has been written to the Writer.
encoder.WriteTimestamp(system::GetUnixEpoch());

// There's no intermediate buffering when writing a string directly to a
// StreamEncoder.
encoder.WriteWelcomeMessage("Welcome to Pigweed!");

if (!encoder.status().ok()) {
  PW_LOG_INFO("Failed to encode proto; %s", encoder.status().str());
}

Callbacks#

Pigweed AI summary: The use of the Write() method with a struct Message may require a callback function to be set for certain fields to properly encode their values. The callback is called with the cursor at the field and passed a reference to the encoder. Callback implementations can use any level of API, such as calling Write() on a nested encoder. An example code snippet is provided for setting a callback function.

When using the Write() method with a struct Message, certain fields may require a callback function be set to encode the values for those fields. Otherwise the values will be treated as an empty repeated field and not encoded.

The callback is called with the cursor at the field in question, and passed a reference to the typed encoder that can write the required values to the stream or buffer.

Callback implementations may use any level of API. For example a callback for a nested submessage (with a dependency cycle, or repeated) can be implemented by calling Write() on a nested encoder.

Store::Message store{};
store.employees.SetEncoder([](Store::StreamEncoder& encoder) {
  Employee::Message employee{};
  // Populate `employee`.
  return encoder.GetEmployeesEncoder().Write(employee);
));

Nested submessages#

Code generated GetFieldEncoder methods are provided that return a correctly typed StreamEncoder or MemoryEncoder for the message.

message Owner {
  Animal pet = 1;
}

Note that the accessor method is named for the field, while the returned encoder is named for the message type.

Animal::StreamEncoder Owner::StreamEncoder::GetPetEncoder()#

A lower-level API method returns an untyped encoder, which only provides the lower-level API methods. This can be cast to a typed encoder if needed.

pw::protobuf::StreamEncoder pw::protobuf::StreamEncoder::GetNestedEncoder(uint32_t field_number, EmptyEncoderBehavior empty_encoder_behavior = EmptyEncoderBehavior::kWriteFieldNumber)#

(The optional empty_encoder_behavior parameter allows the user to disable writing the tag number for the nested encoder, if no data was written to that nested decoder.)

Warning

When a nested submessage is created, any use of the parent encoder that created the nested encoder will trigger a crash. To resume using the parent encoder, destroy the submessage encoder first.

Buffering#

Pigweed AI summary: This article discusses the limitations of the proto format when writing proto messages with nested submessages, which requires buffering. A streaming encoder can be passed a scratch buffer to use when constructing nested messages, and all submessage data is buffered to this scratch buffer until the submessage is finalized. The article also provides guidance on calculating the appropriate size of the scratch buffer and the destination buffer, and warns that if the scratch buffer size is not sufficient, the encoding will fail with Status::ResourceExhausted().

Writing proto messages with nested submessages requires buffering due to limitations of the proto format. Every proto submessage must know the size of the submessage before its final serialization can begin. A streaming encoder can be passed a scratch buffer to use when constructing nested messages. All submessage data is buffered to this scratch buffer until the submessage is finalized. Note that the contents of this scratch buffer is not necessarily valid proto data, so don’t try to use it directly.

The code generation includes a kScratchBufferSizeBytes constant that represents the size of the largest submessage and all necessary overhead, excluding the contents of any field values which require a callback.

If a submessage field requires a callback, due to a dependency cycle, or a repeated field of unknown length, the size of the submessage cannot be included in the kScratchBufferSizeBytes constant. If you encode a submessage of this type (which you’ll know you’re doing because you set an encoder callback for it) simply add the appropriate structure’s kMaxEncodedSizeBytes constant to the scratch buffer size to guarantee enough space.

When calculating yourself, the MaxScratchBufferSize() helper function can also be useful in estimating how much space to allocate to account for nested submessage encoding overhead.

#include "my_protos/pets.pwpb.h"
#include "pw_bytes/span.h"
#include "pw_protobuf/encoder.h"
#include "pw_stream/sys_io_stream.h"

pw::stream::SysIoWriter sys_io_writer;
// The scratch buffer should be at least as big as the largest nested
// submessage. It's a good idea to be a little generous.
std::byte submessage_scratch_buffer[Owner::kScratchBufferSizeBytes];

// Provide the scratch buffer to the proto encoder. The buffer's lifetime must
// match the lifetime of the encoder.
Owner::StreamEncoder owner_encoder(sys_io_writer, submessage_scratch_buffer);

{
  // Note that the parent encoder, owner_encoder, cannot be used until the
  // nested encoder, pet_encoder, has been destroyed.
  Animal::StreamEncoder pet_encoder = owner_encoder.GetPetEncoder();

  // There's intermediate buffering when writing to a nested encoder.
  pet_encoder.WriteName("Spot");
  pet_encoder.WriteType(Pet::Type::DOG);

  // When this scope ends, the nested encoder is serialized to the Writer.
  // In addition, the parent encoder, owner_encoder, can be used again.
}

// If an encode error occurs when encoding the nested messages, it will be
// reflected at the root encoder.
if (!owner_encoder.status().ok()) {
  PW_LOG_INFO("Failed to encode proto; %s", owner_encoder.status().str());
}

MemoryEncoder objects use the final destination buffer rather than relying on a scratch buffer. The kMaxEncodedSizeBytes constant takes into account the overhead required for nesting submessages. If you calculate the buffer size yourself, your destination buffer might need additional space.

Warning

If the scratch buffer size is not sufficient, the encoding will fail with Status::ResourceExhausted(). Always check the results of Write calls or the encoder status to ensure success, as otherwise the encoded data will be invalid.

Scalar Fields#

As shown, scalar fields are written using code generated WriteFoo methods that accept the appropriate type and automatically writes the correct field number.

Status MyProto::StreamEncoder::WriteFoo(T)#

These can be freely intermixed with the lower-level API that provides a method per field type, requiring that the field number be passed in. The code generation includes a Fields enum to provide the field number values.

Status pw::protobuf::StreamEncoder::WriteUint64(uint32_t field_number, uint64_t)#
Status pw::protobuf::StreamEncoder::WriteSint64(uint32_t field_number, int64_t)#
Status pw::protobuf::StreamEncoder::WriteInt64(uint32_t field_number, int64_t)#
Status pw::protobuf::StreamEncoder::WriteUint32(uint32_t field_number, uint32_t)#
Status pw::protobuf::StreamEncoder::WriteSint32(uint32_t field_number, int32_t)#
Status pw::protobuf::StreamEncoder::WriteInt32(uint32_t field_number, int32_t)#
Status pw::protobuf::StreamEncoder::WriteFixed64(uint32_t field_number, uint64_t)#
Status pw::protobuf::StreamEncoder::WriteFixed32(uint32_t field_number, uint64_t)#
Status pw::protobuf::StreamEncoder::WriteDouble(uint32_t field_number, double)#
Status pw::protobuf::StreamEncoder::WriteFloat(uint32_t field_number, float)#
Status pw::protobuf::StreamEncoder::WriteBool(uint32_t field_number, bool)#

The following two method calls are equivalent, where the first is using the code generated API, and the second implemented by hand.

my_proto_encoder.WriteAge(42);
my_proto_encoder.WriteInt32(static_cast<uint32_t>(MyProto::Fields::kAge), 42);

Repeated Fields#

For repeated scalar fields, multiple code generated WriteFoos methods are provided.

Status MyProto::StreamEncoder::WriteFoos(T)#

This writes a single unpacked value.

Status MyProto::StreamEncoder::WriteFoos(pw::span<const T>)#
Status MyProto::StreamEncoder::WriteFoos(const pw::Vector<T>&)#

These write a packed field containing all of the values in the provided span or vector.

These too can be freely intermixed with the lower-level API methods, both to write a single value, or to write packed values from either a pw::span or pw::Vector source.

Status pw::protobuf::StreamEncoder::WritePackedUint64(uint32_t field_number, pw::span<const uint64_t>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedUint64(uint32_t field_number, const pw::Vector<uint64_t>&)#
Status pw::protobuf::StreamEncoder::WritePackedSint64(uint32_t field_number, pw::span<const int64_t>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedSint64(uint32_t field_number, const pw::Vector<int64_t>&)#
Status pw::protobuf::StreamEncoder::WritePackedInt64(uint32_t field_number, pw::span<const int64_t>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedInt64(uint32_t field_number, const pw::Vector<int64_t>&)#
Status pw::protobuf::StreamEncoder::WritePackedUint32(uint32_t field_number, pw::span<const uint32_t>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedUint32(uint32_t field_number, const pw::Vector<uint32_t>&)#
Status pw::protobuf::StreamEncoder::WritePackedSint32(uint32_t field_number, pw::span<const int32_t>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedSint32(uint32_t field_number, const pw::Vector<int32_t>&)#
Status pw::protobuf::StreamEncoder::WritePackedInt32(uint32_t field_number, pw::span<const int32_t>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedInt32(uint32_t field_number, const pw::Vector<int32_t>&)#
Status pw::protobuf::StreamEncoder::WritePackedFixed64(uint32_t field_number, pw::span<const uint64_t>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedFixed64(uint32_t field_number, const pw::Vector<uint64_t>&)#
Status pw::protobuf::StreamEncoder::WritePackedFixed32(uint32_t field_number, pw::span<const uint64_t>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedFixed32(uint32_t field_number, const pw::Vector<uint64_t>&)#
Status pw::protobuf::StreamEncoder::WritePackedDouble(uint32_t field_number, pw::span<const double>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedDouble(uint32_t field_number, const pw::Vector<double>&)#
Status pw::protobuf::StreamEncoder::WritePackedFloat(uint32_t field_number, pw::span<const float>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedFloat(uint32_t field_number, const pw::Vector<float>&)#
Status pw::protobuf::StreamEncoder::WritePackedBool(uint32_t field_number, pw::span<const bool>)#
Status pw::protobuf::StreamEncoder::WriteRepeatedBool(uint32_t field_number, const pw::Vector<bool>&)#

The following two method calls are equivalent, where the first is using the code generated API, and the second implemented by hand.

constexpr std::array<int32_t, 5> numbers = { 4, 8, 15, 16, 23, 42 };

my_proto_encoder.WriteNumbers(numbers);
my_proto_encoder.WritePackedInt32(
    static_cast<uint32_t>(MyProto::Fields::kNumbers),
    numbers);

Enumerations#

Enumerations are written using code generated WriteEnum methods that accept the code generated enumeration as the appropriate type and automatically writes both the correct field number and corresponding value.

Status MyProto::StreamEncoder::WriteEnum(MyProto::Enum)#

To write enumerations with the lower-level API, you would need to cast both the field number and value to the uint32_t type.

The following two methods are equivalent, where the first is code generated, and the second implemented by hand.

my_proto_encoder.WriteAward(MyProto::Award::SILVER);
my_proto_encoder.WriteUint32(
    static_cast<uint32_t>(MyProto::Fields::kAward),
    static_cast<uint32_t>(MyProto::Award::SILVER));

Repeated Fields#

For repeated enum fields, multiple code generated WriteEnums methods are provided.

Status MyProto::StreamEncoder::WriteEnums(MyProto::Enums)#

This writes a single unpacked value.

Status MyProto::StreamEncoder::WriteEnums(pw::span<const MyProto::Enums>)#
Status MyProto::StreamEncoder::WriteEnums(const pw::Vector<MyProto::Enums>&)#

These write a packed field containing all of the values in the provided span or vector.

Their use is as scalar fields.

Strings#

Strings fields have multiple code generated methods provided.

Status MyProto::StreamEncoder::WriteName(std::string_view)#
Status MyProto::StreamEncoder::WriteName(const char*, size_t)#

These can be freely intermixed with the lower-level API methods.

Status pw::protobuf::StreamEncoder::WriteString(uint32_t field_number, std::string_view)#
Status pw::protobuf::StreamEncoder::WriteString(uint32_t field_number, const char*, size_t)#

A lower level API method is provided that can write a string from another stream.

Status pw::protobuf::StreamEncoder::WriteStringFromStream(uint32_t field_number, stream::Reader &bytes_reader, size_t num_bytes, ByteSpan stream_pipe_buffer)#

The payload for the value is provided through the stream::Reader bytes_reader. The method reads a chunk of the data from the reader using the stream_pipe_buffer and writes it to the encoder.

Bytes#

Bytes fields provide the WriteData code generated method.

Status MyProto::StreamEncoder::WriteData(ConstByteSpan)#

This can be freely intermixed with the lower-level API method.

Status pw::protobuf::StreamEncoder::WriteBytes(uint32_t field_number, ConstByteSpan)#

And with the API method that can write bytes from another stream.

Status pw::protobuf::StreamEncoder::WriteBytesFromStream(uint32_t field_number, stream::Reader &bytes_reader, size_t num_bytes, ByteSpan stream_pipe_buffer)#

The payload for the value is provided through the stream::Reader bytes_reader. The method reads a chunk of the data from the reader using the stream_pipe_buffer and writes it to the encoder.

Error Handling#

Pigweed AI summary: The proto encoder returns pw::Status objects for individual write calls, but it latches onto the first error encountered and this status can be accessed through StreamEncoder::status().

While individual write calls on a proto encoder return pw::Status objects, the encoder tracks all status returns and “latches” onto the first error encountered. This status can be accessed via StreamEncoder::status().

Proto map encoding utils#

Pigweed AI summary: This section provides additional helpers for encoding complex protobuf submessages, such as map<string, bytes>, which can be found in pw_protobuf/map_utils.h. However, it should be noted that the helper API is still in development and may not remain stable.

Some additional helpers for encoding more complex but common protobuf submessages (e.g. map<string, bytes>) are provided in pw_protobuf/map_utils.h.

Note

The helper API are currently in-development and may not remain stable.

Decoding#

The simplest way to use StreamDecoder is to decode a proto from the stream into its code generated Message structure.

#include "my_protos/my_proto.pwpb.h"
#include "pw_protobuf/stream_decoder.h"
#include "pw_status/status.h"
#include "pw_stream/stream.h"

pw::Status DecodeProtoFromStream(pw::stream::Reader& reader) {
  MyProto::Message message{};
  MyProto::StreamDecoder decoder(reader);
  decoder.Read(message);
  return decoder.status();
}

In the case of errors, the decoding will stop and return with the cursor on the field that caused the error. It is valid in some cases to inspect the error and continue decoding by calling Read() again on the same structure, or fall back to using the lower-level APIs.

Unknown fields in the wire encoding are skipped.

If finer-grained control is required, the StreamDecoder class provides an iterator-style API for processing a message a field at a time where calling Next() advances the decoder to the next proto field.

Status pw::protobuf::StreamDecoder::Next()#

In the code generated classes the Field() method returns the current field as a typed Fields enumeration member, while the lower-level API provides a FieldNumber() method that returns the number of the field.

Result<MyProto::Fields> MyProto::StreamDecoder::Field()#
Result<uint32_t> pw::protobuf::StreamDecoder::FieldNumber()#
#include "my_protos/my_proto.pwpb.h"
#include "pw_protobuf/strema_decoder.h"
#include "pw_status/status.h"
#include "pw_status/try.h"
#include "pw_stream/stream.h"

pw::Status DecodeProtoFromStream(pw::stream::Reader& reader) {
  MyProto::StreamDecoder decoder(reader);
  pw::Status status;

  uint32_t age;
  char name[16];

  // Iterate over the fields in the message. A return value of OK indicates
  // that a valid field has been found and can be read. When the decoder
  // reaches the end of the message, Next() will return OUT_OF_RANGE.
  // Other return values indicate an error trying to decode the message.
  while ((status = decoder.Next()).ok()) {
    // Field() returns a Result<Fields> as it may fail sometimes.
    // However, Field() is guaranteed to be valid after a call to Next()
    // that returns OK, so the value can be used directly here.
    switch (decoder.Field().value()) {
      case MyProto::Fields::kAge: {
        PW_TRY_ASSIGN(age, decoder.ReadAge());
        break;
      }
      case MyProto::Fields::kName:
        // The string field is copied into the provided buffer. If the buffer
        // is too small to fit the string, RESOURCE_EXHAUSTED is returned and
        // the decoder is not advanced, allowing the field to be re-read.
        PW_TRY(decoder.ReadName(name));
        break;
    }
  }

  // Do something with the fields...

  return status.IsOutOfRange() ? OkStatus() : status;
}

Callbacks#

Pigweed AI summary: The use of the Read() method with a struct Message may require a callback function to be set for certain fields, or else a DataLoss error will occur. The callback function is called with the cursor at the field in question and is passed a reference to the typed decoder that can examine and decode the field. Callback implementations can use any level of API, such as calling Read() on a nested decoder. An example code block is provided to demonstrate the use of a callback function with a StreamDecoder.

When using the Read() method with a struct Message, certain fields may require a callback function be set, otherwise a DataLoss error will be returned should that field be encountered in the wire encoding.

The callback is called with the cursor at the field in question, and passed a reference to the typed decoder that can examine the field and be used to decode it.

Callback implementations may use any level of API. For example a callback for a nested submessage (with a dependency cycle, or repeated) can be implemented by calling Read() on a nested decoder.

Store::Message store{};
store.employees.SetDecoder([](Store::StreamDecoder& decoder) {
  PW_ASSERT(decoder.Field().value() == Store::Fields::kEmployees);

  Employee::Message employee{};
  // Set any callbacks on `employee`.
  PW_TRY(decoder.GetEmployeesDecoder().Read(employee));
  // Do things with `employee`.
  return OkStatus();
));

Nested submessages#

Code generated GetFieldDecoder methods are provided that return a correctly typed StreamDecoder for the message.

message Owner {
  Animal pet = 1;
}

As with encoding, note that the accessor method is named for the field, while the returned decoder is named for the message type.

Animal::StreamDecoder Owner::StreamDecoder::GetPetDecoder()#

A lower-level API method returns an untyped decoder, which only provides the lower-level API methods. This can be moved to a typed decoder later.

pw::protobuf::StreamDecoder pw::protobuf::StreamDecoder::GetNestedDecoder()#

Warning

When a nested submessage is being decoded, any use of the parent decoder that created the nested decoder will trigger a crash. To resume using the parent decoder, destroy the submessage decoder first.

case Owner::Fields::kPet: {
  // Note that the parent decoder, owner_decoder, cannot be used until the
  // nested decoder, pet_decoder, has been destroyed.
  Animal::StreamDecoder pet_decoder = owner_decoder.GetPetDecoder();

  while ((status = pet_decoder.Next()).ok()) {
    switch (pet_decoder.Field().value()) {
      // Decode pet fields...
    }
  }

  // When this scope ends, the nested decoder is destroyed and the
  // parent decoder, owner_decoder, can be used again.
  break;
}

Scalar Fields#

Scalar fields are read using code generated ReadFoo methods that return the appropriate type and assert that the correct field number ie being read.

Result<T> MyProto::StreamDecoder::ReadFoo()#

These can be freely intermixed with the lower-level API that provides a method per field type, requiring that the caller first check the field number.

Result<uint64_t> pw::protobuf::StreamDecoder::ReadUint64()#
Result<int64_t> pw::protobuf::StreamDecoder::ReadSint64()#
Result<int64_t> pw::protobuf::StreamDecoder::ReadInt64()#
Result<uint32_t> pw::protobuf::StreamDecoder::ReadUint32()#
Result<int32_t> pw::protobuf::StreamDecoder::ReadSint32()#
Result<int32_t> pw::protobuf::StreamDecoder::ReadInt32()#
Result<uint64_t> pw::protobuf::StreamDecoder::ReadFixed64()#
Result<uint64_t> pw::protobuf::StreamDecoder::ReadFixed32()#
Result<double> pw::protobuf::StreamDecoder::ReadDouble()#
Result<float> pw::protobuf::StreamDecoder::ReadFloat()#
Result<bool> pw::protobuf::StreamDecoder::ReadBool()#

The following two code snippets are equivalent, where the first uses the code generated API, and the second implemented by hand.

pw::Result<int32_t> age = my_proto_decoder.ReadAge();
PW_ASSERT(my_proto_decoder.FieldNumber().value() ==
    static_cast<uint32_t>(MyProto::Fields::kAge));
pw::Result<int32_t> my_proto_decoder.ReadInt32();

Repeated Fields#

For repeated scalar fields, multiple code generated ReadFoos methods are provided.

Result<T> MyProto::StreamDecoder::ReadFoos()#

This reads a single unpacked value.

StatusWithSize MyProto::StreamDecoder::ReadFoos(pw::span<T>)#

This reads a packed field containing all of the values into the provided span.

Status MyProto::StreamDecoder::ReadFoos(pw::Vector<T>&)#

Protobuf encoders are permitted to choose either repeating single unpacked values, or a packed field, including splitting repeated fields up into multiple packed fields.

This method supports either format, appending values to the provided pw::Vector.

These too can be freely intermixed with the lower-level API methods, to read a single value, a field of packed values into a pw::span, or support both formats appending to a pw::Vector source.

StatusWithSize pw::protobuf::StreamDecoder::ReadPackedUint64(pw::span<uint64_t>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedUint64(pw::Vector<uint64_t>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedSint64(pw::span<int64_t>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedSint64(pw::Vector<int64_t>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedInt64(pw::span<int64_t>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedInt64(pw::Vector<int64_t>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedUint32(pw::span<uint32_t>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedUint32(pw::Vector<uint32_t>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedSint32(pw::span<int32_t>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedSint32(pw::Vector<int32_t>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedInt32(pw::span<int32_t>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedInt32(pw::Vector<int32_t>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedFixed64(pw::span<uint64_t>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedFixed64(pw::Vector<uint64_t>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedFixed32(pw::span<uint64_t>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedFixed32(pw::Vector<uint64_t>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedDouble(pw::span<double>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedDouble(pw::Vector<double>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedFloat(pw::span<float>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedFloat(pw::Vector<float>&)#
StatusWithSize pw::protobuf::StreamDecoder::ReadPackedBool(pw::span<bool>)#
Status pw::protobuf::StreamDecoder::ReadRepeatedBool(pw::Vector<bool>&)#

The following two code blocks are equivalent, where the first uses the code generated API, and the second is implemented by hand.

pw::Vector<int32_t, 8> numbers;

my_proto_decoder.ReadNumbers(numbers);
pw::Vector<int32_t, 8> numbers;

PW_ASSERT(my_proto_decoder.FieldNumber().value() ==
    static_cast<uint32_t>(MyProto::Fields::kNumbers));
my_proto_decoder.ReadRepeatedInt32(numbers);

Enumerations#

pw_protobuf generates a few functions for working with enumerations. Most importantly, enumerations are read using generated ReadEnum methods that return the enumeration as the appropriate generated type.

Result<MyProto::Enum> MyProto::StreamDecoder::ReadEnum()#

Decodes an enum from the stream.

constexpr bool MyProto::IsValidEnum(MyProto::Enum value)#

Validates the value encoded in the wire format against the known set of enumerates.

constexpr const char *MyProto::EnumToString(MyProto::Enum value)#

Returns the string representation of the enum value. For example, FooToString(Foo::kBarBaz) returns "BAR_BAZ". Returns the empty string if the value is not a valid value.

To read enumerations with the lower-level API, you would need to cast the retured value from the uint32_t.

The following two code blocks are equivalent, where the first is using the code generated API, and the second implemented by hand.

pw::Result<MyProto::Award> award = my_proto_decoder.ReadAward();
if (!MyProto::IsValidAward(award)) {
  PW_LOG_DBG("Unknown award");
}
PW_ASSERT(my_proto_decoder.FieldNumber().value() ==
    static_cast<uint32_t>(MyProto::Fields::kAward));
pw::Result<uint32_t> award_value = my_proto_decoder.ReadUint32();
if (award_value.ok()) {
  MyProto::Award award = static_cast<MyProto::Award>(award_value);
}

Repeated Fields#

For repeated enum fields, multiple code generated ReadEnums methods are provided.

Result<MyProto::Enums> MyProto::StreamDecoder::ReadEnums()#

This reads a single unpacked value.

StatusWithSize MyProto::StreamDecoder::ReadEnums(pw::span<MyProto::Enums>)#

This reads a packed field containing all of the checked values into the provided span.

Status MyProto::StreamDecoder::ReadEnums(pw::Vector<MyProto::Enums>&)#

This method supports either repeated unpacked or packed formats, appending checked values to the provided pw::Vector.

Their use is as scalar fields.

Strings#

Strings fields provide a code generated method to read the string into the provided span. Since the span is updated with the size of the string, the string is not automatically null-terminated. pw_string provides utility methods to copy string data from spans into other targets.

StatusWithSize MyProto::StreamDecoder::ReadName(pw::span<char>)#

An additional code generated method is provided to return a nested BytesReader to access the data as a stream. As with nested submessage decoders, any use of the parent decoder that created the bytes reader will trigger a crash. To resume using the parent decoder, destroy the bytes reader first.

pw::protobuf::StreamDecoder::BytesReader MyProto::StreamDecoder::GetNameReader()#

These can be freely intermixed with the lower-level API method:

StatusWithSize pw::protobuf::StreamDecoder::ReadString(pw::span<char>)#

The lower-level GetBytesReader() method can also be used to read string data as bytes.

Bytes#

Bytes fields provide the WriteData code generated method to read the bytes into the provided span.

StatusWithSize MyProto::StreamDecoder::ReadData(ByteSpan)#

An additional code generated method is provided to return a nested BytesReader to access the data as a stream. As with nested submessage decoders, any use of the parent decoder that created the bytes reader will trigger a crash. To resume using the parent decoder, destroy the bytes reader first.

pw::protobuf::StreamDecoder::BytesReader MyProto::StreamDecoder::GetDataReader()#

These can be freely intermixed with the lower-level API methods.

StatusWithSize pw::protobuf::StreamDecoder::ReadBytes(ByteSpan)#
pw::protobuf::StreamDecoder::BytesReader pw::protobuf::StreamDecoder::GetBytesReader()#

The BytesReader supports seeking only if the StreamDecoder’s reader supports seeking.

Error Handling#

Pigweed AI summary: The article explains how the proto decoder handles errors by tracking all status returns and latching onto the first error encountered. The status can be accessed through StreamDecoder::status().

While individual read calls on a proto decoder return pw::Result, pw::StatusWithSize, or pw::Status objects, the decoder tracks all status returns and “latches” onto the first error encountered. This status can be accessed via StreamDecoder::status().

Length Limited Decoding#

Pigweed AI summary: This section discusses length limited decoding in protobuf messages. By specifying the known length of the message to the decoder, it can be prevented from reading beyond the known bounds. The code example provided shows how to construct a decoder with a specified message length. When the decoder goes out of scope, it will consume any remaining bytes in the message length, allowing the next Read() on the stream to be data after the protobuf, even if it was not fully parsed.

Where the length of the protobuf message is known in advance, the decoder can be prevented from reading from the stream beyond the known bounds by specifying the known length to the decoder:

pw::protobuf::StreamDecoder decoder(reader, message_length);

When a decoder constructed in this way goes out of scope, it will consume any remaining bytes in message_length allowing the next Read() on the stream to be data after the protobuf, even when it was not fully parsed.

In-memory Decoder#

Pigweed AI summary: The in-memory decoder class is more efficient than the stream decoder when the complete protobuf data can be stored in memory. However, it requires manual decoding as no code generation is provided. It provides an iterator-style API for processing a message and returns a view of the field within the buffer when reading bytes and string fields. The provided code shows an example of how to use the in-memory decoder class to decode a protobuf message from a buffer.

The separate Decoder class operates on an protobuf message located in a buffer in memory. It is more efficient than the StreamDecoder in cases where the complete protobuf data can be stored in memory. The tradeoff of this efficiency is that no code generation is provided, so all decoding must be performed by hand.

As StreamDecoder, it provides an iterator-style API for processing a message. Calling Next() advances the decoder to the next proto field, which can then be read by calling the appropriate Read* function for the field number.

When reading bytes and string fields, the decoder returns a view of that field within the buffer; no data is copied out.

#include "pw_protobuf/decoder.h"
#include "pw_status/try.h"

pw::Status DecodeProtoFromBuffer(pw::span<const std::byte> buffer) {
  pw::protobuf::Decoder decoder(buffer);
  pw::Status status;

  uint32_t uint32_field;
  std::string_view string_field;

  // Iterate over the fields in the message. A return value of OK indicates
  // that a valid field has been found and can be read. When the decoder
  // reaches the end of the message, Next() will return OUT_OF_RANGE.
  // Other return values indicate an error trying to decode the message.
  while ((status = decoder.Next()).ok()) {
    switch (decoder.FieldNumber()) {
      case 1:
        PW_TRY(decoder.ReadUint32(&uint32_field));
        break;
      case 2:
        // The passed-in string_view will point to the contents of the string
        // field within the buffer.
        PW_TRY(decoder.ReadString(&string_field));
        break;
    }
  }

  // Do something with the fields...

  return status.IsOutOfRange() ? OkStatus() : status;
}

Message Decoder#

Pigweed AI summary: The Message Decoder module provides a class for parsing and accessing fields in a proto message, with interfaces for different field types. It requires a SeekableReader for message access and can be used to process different fields in a proto message. The module also includes a for-range style for iterating and processing single fields directly, which is recommended for performance. The helper API is currently in-development and may not remain stable.

Note

pw::protobuf::Message is unrelated to the codegen struct Message used with StreamDecoder.

The module implements a message parsing helper class Message, in pw_protobuf/message.h, to faciliate proto message parsing and field access. The class provides interfaces for searching fields in a proto message and creating helper classes for it according to its interpreted field type, i.e. uint32, bytes, string, map<>, repeated etc. The class works on top of StreamDecoder and thus requires a pw::stream::SeekableReader for proto message access. The following gives examples for using the class to process different fields in a proto message:

// Consider the proto messages defined as follows:
//
// message Nested {
//   string nested_str = 1;
//   bytes nested_bytes = 2;
// }
//
// message {
//   uint32 integer = 1;
//   string str = 2;
//   bytes bytes = 3;
//   Nested nested = 4;
//   repeated string rep_str = 5;
//   repeated Nested rep_nested  = 6;
//   map<string, bytes> str_to_bytes = 7;
//   map<string, Nested> str_to_nested = 8;
// }

// Given a seekable `reader` that reads the top-level proto message, and
// a <proto_size> that gives the size of the proto message:
Message message(reader, proto_size);

// Parse a proto integer field
Uint32 integer = messasge_parser.AsUint32(1);
if (!integer.ok()) {
  // handle parsing error. i.e. return integer.status().
}
uint32_t integer_value = integer.value(); // obtained the value

// Parse a string field
String str = message.AsString(2);
if (!str.ok()) {
  // handle parsing error. i.e. return str.status();
}

// check string equal
Result<bool> str_check = str.Equal("foo");

// Parse a bytes field
Bytes bytes = message.AsBytes(3);
if (!bytes.ok()) {
  // handle parsing error. i.e. return bytes.status();
}

// Get a reader to the bytes.
stream::IntervalReader bytes_reader = bytes.GetBytesReader();

// Parse nested message `Nested nested = 4;`
Message nested = message.AsMessage(4).
// Get the fields in the nested message.
String nested_str = nested.AsString(1);
Bytes nested_bytes = nested.AsBytes(2);

// Parse repeated field `repeated string rep_str = 5;`
RepeatedStrings rep_str = message.AsRepeatedString(5);
// Iterate through the entries. If proto is malformed when
// iterating, the next element (`str` in this case) will be invalid
// and loop will end in the iteration after.
for (String element : rep_str) {
  // Check status
  if (!str.ok()) {
    // In the case of error, loop will end in the next iteration if
    // continues. This is the chance for code to catch the error.
  }
  // Process str
}

// Parse repeated field `repeated Nested rep_nested = 6;`
RepeatedStrings rep_str = message.AsRepeatedString(6);
// Iterate through the entries. For iteration
for (Message element : rep_rep_nestedstr) {
  // Check status
  if (!element.ok()) {
    // In the case of error, loop will end in the next iteration if
    // continues. This is the chance for code to catch the error.
  }
  // Process element
}

// Parse map field `map<string, bytes> str_to_bytes = 7;`
StringToBytesMap str_to_bytes = message.AsStringToBytesMap(7);
// Access the entry by a given key value
Bytes bytes_for_key = str_to_bytes["key"];
// Or iterate through map entries
for (StringToBytesMapEntry entry : str_to_bytes) {
  // Check status
  if (!entry.ok()) {
    // In the case of error, loop will end in the next iteration if
    // continues. This is the chance for code to catch the error.
  }
  String key = entry.Key();
  Bytes value = entry.Value();
  // process entry
}

// Parse map field `map<string, Nested> str_to_nested = 8;`
StringToMessageMap str_to_nested = message.AsStringToBytesMap(8);
// Access the entry by a given key value
Message nested_for_key = str_to_nested["key"];
// Or iterate through map entries
for (StringToMessageMapEntry entry : str_to_nested) {
  // Check status
  if (!entry.ok()) {
    // In the case of error, loop will end in the next iteration if
    // continues. This is the chance for code to catch the error.
    // However it is still recommended that the user breaks here.
    break;
  }
  String key = entry.Key();
  Message value = entry.Value();
  // process entry
}

The methods in Message for parsing a single field, i.e. everty AsXXX() method except AsRepeatedXXX() and AsStringMapXXX(), internally performs a linear scan of the entire proto message to find the field with the given field number. This can be expensive if performed multiple times, especially on slow reader. The same applies to the operator[] of StringToXXXXMap helper class. Therefore, for performance consideration, whenever possible, it is recommended to use the following for-range style to iterate and process single fields directly.

for (Message::Field field : message) {
  // Check status
  if (!field.ok()) {
    // In the case of error, loop will end in the next iteration if
    // continues. This is the chance for code to catch the error.
  }
  if (field.field_number() == 1) {
    Uint32 integer = field.As<Uint32>();
    ...
  } else if (field.field_number() == 2) {
    String str = field.As<String>();
    ...
  } else if (field.field_number() == 3) {
    Bytes bytes = field.As<Bytes>();
    ...
  } else if (field.field_number() == 4) {
    Message nested = field.As<Message>();
    ...
  }
}

Note

The helper API are currently in-development and may not remain stable.

Size report#

Pigweed AI summary: This report discusses the size of using the entire decoder with all of its decode methods and a decode callback for a proto message containing each of the protobuf field types. It also includes an incremental size report that adds some int32 fields to the decode callback to demonstrate the incremental cost of decoding fields in a message. However, the report warns that size reports will not be generated due to an empty build variable for the target and provides instructions on how to set up size reports.

Full size report#

Pigweed AI summary: This report showcases the size of using the entire decoder with all of its decode methods and a decode callback for a proto message containing each of the protobuf field types. However, the pw_size_report_toolchains build variable is empty for this target, which means that size reports will not be generated. The report suggests referring to the documentation on how to set up size reports.

This report demonstrates the size of using the entire decoder with all of its decode methods and a decode callback for a proto message containing each of the protobuf field types.

Warning

The pw_size_report_toolchains build variable is empty for this target. Size reports will not be generated.

See Defining size reports in GN for details on how to set up size reports.

Incremental size report#

Pigweed AI summary: This paragraph describes the "Incremental size report" which is generated by adding int32 fields to the decode callback to show the cost of decoding fields in a message. However, it also warns that the pw_size_report_toolchains build variable is empty for this target, so size reports will not be generated. It provides a reference for setting up size reports in GN.

This report is generated using the full report as a base and adding some int32 fields to the decode callback to demonstrate the incremental cost of decoding fields in a message.

Warning

The pw_size_report_toolchains build variable is empty for this target. Size reports will not be generated.

See Defining size reports in GN for details on how to set up size reports.

Serialized size calculation#

Pigweed AI summary: The <literal>pw_protobuf/serialized_size.h</literal> library provides functions for calculating the memory required for serialized protocol buffer fields. The <literal>kMaxSizeBytes*</literal> variables give the maximum encoded sizes for each field type, while the <literal>SizeOfField*()</literal> functions calculate the encoded size of a field of a specified type. The <literal>SizeOf*Field</literal> functions calculate the encoded size of fields with a particular wire format. In rare

pw_protobuf/serialized_size.h provides a set of functions for calculating how much memory serialized protocol buffer fields require. The kMaxSizeBytes* variables provide the maximum encoded sizes of each field type. The SizeOfField*() functions calculate the encoded size of a field of the specified type, given a particular key and, for variable length fields (varint or delimited), a value. The SizeOf*Field functions calculate the encoded size of fields with a particular wire format (delimited, varint).

In the rare event that you need to know the serialized size of a field’s tag (field number and wire type), you can use TagSizeBytes() to calculate the tag size for a given field number.

Available protobuf modules#

Pigweed AI summary: This paragraph describes the available protobuf modules for Pigweed projects, which are located in pw_protobuf/pw_protobuf_protos. It also highlights two specific modules: common.proto, which contains an Empty message proto used in many RPC calls, and status.proto, which contains the enum for pw::Status. The note section advises against using pw::protobuf::StatusCode values outside of a .proto file and provides code examples for converting StatusCodes to the Status type in C++.

There are a handful of messages ready to be used in Pigweed projects. These are located in pw_protobuf/pw_protobuf_protos.

common.proto#

Pigweed AI summary: The "common.proto" file contains an Empty message protocol that is frequently used in various RPC calls.

Contains Empty message proto used in many RPC calls.

status.proto#

Pigweed AI summary: This paragraph discusses the contents of the "status.proto" file, which includes an enum for pw::Status. It also provides a note that advises against using pw::protobuf::StatusCode values outside of a .proto file, and instead converting them to the Status type in the language being used. The example given is for C++, where the conversion can be done using static_cast.

Contains the enum for pw::Status.

Note

pw::protobuf::StatusCode values should not be used outside of a .proto file. Instead, the StatusCodes should be converted to the Status type in the language. In C++, this would be:

// Reading from a proto
pw::Status status = static_cast<pw::Status::Code>(proto.status_field));
// Writing to a proto
proto.status_field = static_cast<pw::protobuf::StatusCode>(status.code()));

Comparison with other protobuf libraries#

Pigweed AI summary: This section discusses two different protobuf libraries for use in embedded systems. The first, protobuf-lite, is the official C++ implementation of protobuf but is still relatively large in size and requires dynamic memory allocation. The second, nanopb, is a commonly used embedded protobuf library with very small code size and full code generation, but can run into RAM usage issues when processing nontrivial protobuf messages. To avoid this issue, it is possible to use nanopb’s low-level encode/decode functions, but this

protobuf-lite#

Pigweed AI summary: Protobuf-lite is a smaller version of the protobuf library for C++, designed to reduce code size by using a limited set of features. However, it is still relatively large at around 150K and requires dynamic memory allocation, making it unsuitable for many embedded systems.

protobuf-lite is the official reduced-size C++ implementation of protobuf. It uses a restricted subset of the protobuf library’s features to minimize code size. However, is is still around 150K in size and requires dynamic memory allocation, making it unsuitable for many embedded systems.

nanopb#

Pigweed AI summary: nanopb is a widely used embedded protobuf library that offers small code size and full code generation, providing encoding/decoding functionality and in-memory C structs representing protobuf messages. However, using its generated code can lead to RAM usage issues when processing nontrivial protobuf messages due to the need to define a struct capable of storing all configurations of the message, which can grow incredibly large. To avoid this issue, it is possible to use nanopb’s low-level encode/decode functions to process individual message

nanopb is a commonly used embedded protobuf library with very small code size and full code generation. It provides both encoding/decoding functionality and in-memory C structs representing protobuf messages.

nanopb works well for many embedded products; however, using its generated code can run into RAM usage issues when processing nontrivial protobuf messages due to the necessity of defining a struct capable of storing all configurations of the message, which can grow incredibly large. In one project, Pigweed developers encountered an 11K struct statically allocated for a single message—over twice the size of the final encoded output! (This was what prompted the development of pw_protobuf.)

To avoid this issue, it is possible to use nanopb’s low-level encode/decode functions to process individual message fields directly, but this loses all of the useful semantics of code generation. pw_protobuf is designed to optimize for this use case; it allows for efficient operations on the wire format with an intuitive user interface.

Depending on the requirements of a project, either of these libraries could be suitable.