[DTrace-devel] [PATCH v2] Implement variable length integer support

Mon Jun 14 06:16:20 PDT 2021

On Sat, Jun 12, 2021 at 01:30:58AM -0400, Eugene Loh wrote:
> On 6/11/21 2:57 PM, Kris Van Hees wrote:
> 
> > This patch provides an implementation of variable-length integers,
> > both in libdtrace and as precompiled BPF code.  It is based on the
> > PrefixVarint algorithm.  Unsigned integers are encoded as follows:
> >
> >    0x0 - 0x7f		1 byte:  0xxxxxxx
> >    0x80 - 0x407f		2 bytes: 10xxxxxx xxxxxxxx
> >    0x4080 - 0x20407f	3 bytes: 110xxxxx xxxxxxxx xxxxxxxx
> >
> > and so on.  This algorithm ensures that there is exactly one
> > representation for a given unsigned integers.  It also ensures that
> This patch looks basically unchanged from the first version I saw, aside 
> from the addition of the test.  Was the other feedback considered?  
> E.g., I had started by noting that
>          a given unsigned integers
>      ->
>          a given unsigned integer
> but that has not been changed.  (This is a strange sentence anyhow: why 
> do we care whether there is a unique representation?)

The commit messages got lost somewhere in my reworking of patches - will fix
that.  Unique representation (essentially a bijection) matters because it
ensures that we use the minimal number of bytes for any given value.
> 
> And, again, a few words in the commit message about why this patch is 
> being done would be nice.

I try to avoid giving a 'preview' of what code like this is being added for
because it can actually have multiple purposes and I didn't want to get into
the details on how it is going to get used.  That would be outside the scope of
the actual patch.  It is obvious enough when you look at the patches that
follow so I don't think it adds value to the commit message,  Variable length
integers can be useful outside of string support.

> > the number of bytes in the variable length representation can be
> > determined based on the first byte.  This makes decoding simpler.
> >
> > The following functions are provided:
> >
> >    int dt_int2vint(uint64_t val, char *str)
> > 	Store a 64-bit unsigned integer in the provided string and
> > 	return the number of bytes used.
> >
> >    uint64_t dt_vint2int(const char *str)
> > 	Retrieve a 64-bit unsigned integer from a variable length
> > 	integer in the provided string.
> 
> How about s/in the/that is in the/?  Very minor point, but reads more 
> clearly to me.

Hm, I think that just adds words that don't contribute much to clarity since
the sentence already states that the varint is in the string.

> >    int dt_vint_size(uint64_t val)
> > 	Return the number of bytes necessary to represent the given
> > 	64-bit unsigned integer as a variable length integer.
> >
> >    const char *dt_vint_skip(const char *str)
> > 	Skip over the variable length integer at the start of the
> > 	given string and return a pointer to the first byte beyond
> > 	the variable length integer.
> >
> > Signed-off-by: Kris Van Hees <kris.van.hees at oracle.com>
> > ---
> >   bpf/Build                   |   3 +-
> >   bpf/varint.c                | 199 +++++++++++++++++++++++++++++++++++
> >   libdtrace/Build             |   1 +
> >   libdtrace/dt_varint.c       | 203 ++++++++++++++++++++++++++++++++++++
> >   libdtrace/dt_varint.h       |  82 +++++++++++++++
> >   test/internals/tst.varint.c | 130 +++++++++++++++++++++++
> >   6 files changed, 617 insertions(+), 1 deletion(-)
> >   create mode 100644 bpf/varint.c
> >   create mode 100644 libdtrace/dt_varint.c
> >   create mode 100644 libdtrace/dt_varint.h
> >   create mode 100644 test/internals/tst.varint.c
> 
> The bulk of this patch is those two files
>          bpf/varint.c
>          libdtrace/dt_varint.c
> which are virtually identical and account for about 400 lines.  How 
> about combining them into essentially one copy?  Say, by using some 
> #include magic?  Copy-and-paste is an efficient way of generating new 
> code, but costs a lot in later maintenance.

I think that calling the cost 'a lot' is quite an overstatement.  Maybe there
will be value in putting the code in a single file and including it in the
other, but somehow I am not sure which I would pick.  Put it in libdtrace and
have BPF include that?  Or have it in the BPF source tree and have libstrace
include that?  Both components (libdtrace and BPF) are really things that ought
to be distinct, even if they implement the same functionality.  I guess the
code could be in a neutral place and they both include it, but that just seems
ugly to me.

So I prefer to keep it as two separate files for now.  In the future, who knows
what we may change...

> > diff --git a/bpf/varint.c b/bpf/varint.c
> > new file mode 100644
> > index 00000000..2b525cd1
> > --- /dev/null
> > +++ b/bpf/varint.c
> > @@ -0,0 +1,199 @@
> > +// SPDX-License-Identifier: GPL-2.0
> > +/*
> > + * Copyright (c) 2021, Oracle and/or its affiliates. All rights reserved.
> > + */
> > +#include <stdint.h>
> > +#include <dt_varint.h>
> > +
> > +#ifndef noinline
> > +# define noinline	__attribute__((noinline))
> > +#endif
> > +
> > +noinline int dt_int2vint(uint64_t val, char *str)
> > +{
> > +	int	len;
> > +	uint8_t	*buf = (uint8_t *)str;
> > +
> > +	if (val <= VARINT_1_MAX) {
> > +		buf[0] = (uint8_t)val;
> > +		return 1;
> > +	} else if (val <= VARINT_2_MAX) {
> > +		val -= VARINT_2_MIN;
> > +		len = 2;
> > +		buf[0] = VARINT_2_PREFIX | (uint8_t)(val >> VARINT_2_SHIFT);
> > +		goto two;
> > +	} else if (val <= VARINT_3_MAX) {
> > +		val -= VARINT_3_MIN;
> > +		len = 3;
> > +		buf[0] = VARINT_3_PREFIX | (uint8_t)(val >> VARINT_3_SHIFT);
> > +		goto three;
> > +	} else if (val <= VARINT_4_MAX) {
> > +		val -= VARINT_4_MIN;
> > +		len = 4;
> > +		buf[0] = VARINT_4_PREFIX | (uint8_t)(val >> VARINT_4_SHIFT);
> > +		goto four;
> > +	} else if (val <= VARINT_5_MAX) {
> > +		val -= VARINT_5_MIN;
> > +		len = 5;
> > +		buf[0] = VARINT_5_PREFIX | (uint8_t)(val >> VARINT_5_SHIFT);
> > +		goto five;
> > +	} else if (val <= VARINT_6_MAX) {
> > +		val -= VARINT_6_MIN;
> > +		len = 6;
> > +		buf[0] = VARINT_6_PREFIX | (uint8_t)(val >> VARINT_6_SHIFT);
> > +		goto six;
> > +	} else if (val <= VARINT_7_MAX) {
> > +		val -= VARINT_7_MIN;
> > +		len = 7;
> > +		buf[0] = VARINT_7_PREFIX | (uint8_t)(val >> VARINT_7_SHIFT);
> > +		goto seven;
> > +	} else if (val <= VARINT_8_MAX) {
> > +		val -= VARINT_8_MIN;
> > +		len = 8;
> > +		buf[0] = VARINT_8_PREFIX | (uint8_t)(val >> VARINT_8_SHIFT);
> > +		goto eight;
> > +	}
> > +
> > +	val -= VARINT_9_MIN;
> > +	len = 9;
> > +	buf[0] = VARINT_9_PREFIX;
> > +	buf[8] = (uint8_t)((val >> 56) & 0xff);
> 
> Why are the VARINT_$n_SHIFT symbols defined?  Just go ahead and use the 
> numerical values, which are shorter and easily recognizable and more 
> easily understood than these verbose, funny names that are defined in a 
> different file.  It is hard to imagine there is a good case for the 
> symbols when the very functions that use them also (apparently very 
> happily) use the easier-to-understand numerical values as well.  It is 
> at least inconsistent to use both "VARINT_8_SHIFT" and "56" in nearby 
> lines of the same function.

Two different things...  the defines are part of the set of defines that rule
the encoding of the first byte in the varint.  The other shifts are actual
numeric values because they are simply selecting individual bytes in the
vlaue that we are encoding, each to be stored in consecutive bytes in the 
result.

> > +eight:
> > +	buf[7] = (uint8_t)((val >> 48) & 0xff);
> > +seven:
> > +	buf[6] = (uint8_t)((val >> 40) & 0xff);
> > +six:
> > +	buf[5] = (uint8_t)((val >> 32) & 0xff);
> > +five:
> > +	buf[4] = (uint8_t)((val >> 24) & 0xff);
> > +four:
> > +	buf[3] = (uint8_t)((val >> 16) & 0xff);
> > +three:
> > +	buf[2] = (uint8_t)((val >> 8) & 0xff);
> > +two:
> > +	buf[1] = (uint8_t)(val & 0xff);
> > +
> > +	return len;
> > +}
> > +
> > +noinline uint64_t dt_vint2int(const char *str)
> > +{
> > +	const uint8_t	*buf = (const uint8_t *)str;
> > +	uint64_t	hi = buf[0];
> > +	uint64_t	bs;
> > +	uint64_t	len = 0;
> > +
> > +	if (hi < VARINT_1_PLIM)	  /* 0xxxxxxx -> 0x00 - 0x7f */
> > +		return hi;
> > +	if (hi < VARINT_2_PLIM) { /* 10xxxxxx -> 0x0080 - 0x407f */
> > +		hi &= VARINT_HI_MASK(VARINT_2_PLIM);
> > +		hi <<= VARINT_2_SHIFT;
> > +		bs = VARINT_2_MIN;
> > +		goto two;
> > +	}
> > +	if (hi < VARINT_3_PLIM) { /* 110xxxxx -> 0x4080 - 0x20407f */
> > +		hi &= VARINT_HI_MASK(VARINT_3_PLIM);
> > +		hi <<= VARINT_3_SHIFT;
> > +		bs = VARINT_3_MIN;
> > +		goto three;
> > +	}
> > +	if (hi < VARINT_4_PLIM) { /* 1110xxxx -> 0x204080 - 0x1020407f */
> > +		hi &= VARINT_HI_MASK(VARINT_4_PLIM);
> > +		hi <<= VARINT_4_SHIFT;
> > +		bs = VARINT_4_MIN;
> > +		goto four;
> > +	}
> > +	if (hi < VARINT_5_PLIM) { /* 11110xxx -> 0x10204080 - 0x081020407f */
> > +		hi &= VARINT_HI_MASK(VARINT_5_PLIM);
> > +		hi <<= VARINT_5_SHIFT;
> > +		bs = VARINT_5_MIN;
> > +		goto five;
> > +	}
> > +	if (hi < VARINT_6_PLIM) { /* 111110xx -> 0x0810204080 -> 0x4081020407f */
> 
> Comment strings like
>          /* 111110xx -> 0x0810204080 -> 0x4081020407f */
> are confusing clutter in my opinion.  Drop them.  If there is no 
> higher-level description somewhere else, then someone has no chance of 
> figuring these comments out.  In contrast, if there are higher-level 
> descriptions -- say, in dt_varint.h -- these strings will not be 
> needed.  (Incidentally, the second "->" looks wrong.)  (Also, regarding 
> higher-level descriptions, I make a recommendation below.)

The comment are convenient in knowing what value ramges are covered by each
prefix.  It helped write the test case (which is what I used to test my code
when I was implementing it), and it can help tie things together

And yes, the 2nd -> are mistakes.  Fixed.

> > +		hi &= VARINT_HI_MASK(VARINT_6_PLIM);
> > +		hi <<= VARINT_6_SHIFT;
> > +		bs = VARINT_6_MIN;
> > +		goto six;
> > +	}
> > +	if (hi < VARINT_7_PLIM) { /* 1111110x -> 0x40810204080 -> 0x204081020407f */
> > +		hi &= VARINT_HI_MASK(VARINT_7_PLIM);
> > +		hi <<= VARINT_7_SHIFT;
> > +		bs = VARINT_7_MIN;
> > +		goto seven;
> > +	}
> > +	if (hi < VARINT_8_PLIM) { /* 11111110 -> 0x2040810204080 -> 0x10204081020407f */
> > +		hi = 0;
> > +		bs = VARINT_8_MIN;
> > +		goto eight;
> > +	}
> > +
> > +	/* 11111111 -> 0x102040810204080 - 0xffffffffffffffff */
> > +	hi = 0;
> > +	bs = VARINT_9_MIN;
> > +
> > +	len += ((uint64_t)buf[8]) << 56;
> > +eight:
> > +	len += ((uint64_t)buf[7]) << 48;
> > +seven:
> > +	len += ((uint64_t)buf[6]) << 40;
> > +six:
> > +	len += ((uint64_t)buf[5]) << 32;
> > +five:
> > +	len += ((uint64_t)buf[4]) << 24;
> > +four:
> > +	len += ((uint64_t)buf[3]) << 16;
> > +three:
> > +	len += ((uint64_t)buf[2]) << 8;
> > +two:
> > +	len += (uint64_t)buf[1];
> > +	len += hi;
> > +
> > +	return bs + len;
> > +}
> 
> In dt_vint2int(), I cannot figure out bs or len -- neither what those 
> names are supposed to mean nor why they are defined in the first place.  
> How about just dropping those two variables and then simply adding 
> contributions directly to the output value as you get them?  E.g., stuff 
> like

Yes, 'len' is better named 'val' actaually because we may use this code for
other things besides string lengths.

Variable 'bs' is the base value for the range of the prefix.

> -               bs = VARINT_3_MIN;
> +               hi += VARINT_3_MIN;
> and
> -       bs = VARINT_9_MIN;
> +       hi += VARINT_9_MIN;
> and
> -       len += ((uint64_t)buf[8]) << 56;
> +       hi += ((uint64_t)buf[8]) << 56;
> and then simply
> -       len += hi;
> -       return bs + len;
> +       return hi;
> And perhaps rename "hi" to be "val".

No, this will confuse the code more.  In 'hi' we store the highest order bits
of the value, i.e. those bits that are stored in the available bits in the
first byte).  In 'bs' we store the base value of the range.  Adding 'bs' to 'hi'
prior to the shift is wrong.  It would have to be added after the shift.  But
that is confusing because the decoding process is a reconstruction process that
puts the bits in place in 8 bit chunks (byte by byte).
> 
> > +noinline int dt_vint_size(uint64_t val)
> > +{
> > +	if (val <= VARINT_1_MAX)
> > +		return 1;
> > +	if (val <= VARINT_2_MAX)
> > +		return 2;
> > +	if (val <= VARINT_3_MAX)
> > +		return 3;
> > +	if (val <= VARINT_4_MAX)
> > +		return 4;
> > +	if (val <= VARINT_5_MAX)
> > +		return 5;
> > +	if (val <= VARINT_6_MAX)
> > +		return 6;
> > +	if (val <= VARINT_7_MAX)
> > +		return 7;
> > +	if (val <= VARINT_8_MAX)
> > +		return 8;
> > +
> > +	return 9;
> > +}
> > +
> > +noinline const char *dt_vint_skip(const char *str)
> > +{
> > +	const uint8_t	*buf = (const uint8_t *)str;
> > +	uint64_t	hi = buf[0];
> > +
> > +	if (hi < VARINT_1_PLIM)	 /* 0xxxxxxx -> 0x00 - 0x7f */
> > +		return &str[1];
> > +	if (hi < VARINT_2_PLIM)  /* 10xxxxxx -> 0x0080 - 0x407f */
> > +		return &str[2];
> > +	if (hi < VARINT_3_PLIM)  /* 110xxxxx -> 0x4080 - 0x20407f */
> > +		return &str[3];
> > +	if (hi < VARINT_4_PLIM)  /* 1110xxxx -> 0x204080 - 0x1020407f */
> > +		return &str[4];
> > +	if (hi < VARINT_5_PLIM)  /* 11110xxx -> 0x10204080 - 0x081020407f */
> > +		return &str[5];
> > +	if (hi < VARINT_6_PLIM)  /* 111110xx -> 0x0810204080 -> 0x4081020407f */
> > +		return &str[6];
> > +	if (hi < VARINT_7_PLIM)  /* 1111110x -> 0x40810204080 -> 0x204081020407f */
> > +		return &str[7];
> > +	if (hi < VARINT_8_PLIM)  /* 11111110 -> 0x2040810204080 -> 0x10204081020407f */
> > +		return &str[8];
> > +
> > +	return &str[9];
> > +}
> 
> Awk!  Those confusing comment strings again and the double ->.

Same comment as above.

> > diff --git a/libdtrace/Build b/libdtrace/Build
> > index 47e92736..3555c857 100644
> > --- a/libdtrace/Build
> > +++ b/libdtrace/Build
> > @@ -61,6 +61,7 @@ libdtrace-build_SOURCES = dt_aggregate.c \
> >   			  dt_strtab.c \
> >   			  dt_subr.c \
> >   			  dt_symtab.c \
> > +			  dt_varint.c \
> >   			  dt_work.c \
> >   			  dt_xlator.c
> >   
> > diff --git a/libdtrace/dt_varint.c b/libdtrace/dt_varint.c
> > new file mode 100644
> > index 00000000..72801acd
> > --- /dev/null
> > +++ b/libdtrace/dt_varint.c
> > @@ -0,0 +1,203 @@
> > +/*
> > + * Oracle Linux DTrace.
> > + * Copyright (c) 2021, Oracle and/or its affiliates. All rights reserved.
> > + * Licensed under the Universal Permissive License v 1.0 as shown at
> > + * http://oss.oracle.com/licenses/upl.
> > + */
> > +
> > +#include <sys/types.h>
> > +#include <stdint.h>
> > +#include <dt_varint.h>
> > +
> > +int
> > +dt_int2vint(uint64_t val, char *str)
> > +{
> > +	uint8_t	*buf = (uint8_t *)str;
> > +	int	len;
> > +
> > +	if (val <= VARINT_1_MAX) {
> > +		buf[0] = (uint8_t)val;
> > +		return 1;
> > +	} else if (val <= VARINT_2_MAX) {
> > +		val -= VARINT_2_MIN;
> > +		len = 2;
> > +		buf[0] = VARINT_2_PREFIX | (uint8_t)(val >> VARINT_2_SHIFT);
> > +		goto two;
> > +	} else if (val <= VARINT_3_MAX) {
> > +		val -= VARINT_3_MIN;
> > +		len = 3;
> > +		buf[0] = VARINT_3_PREFIX | (uint8_t)(val >> VARINT_3_SHIFT);
> > +		goto three;
> > +	} else if (val <= VARINT_4_MAX) {
> > +		val -= VARINT_4_MIN;
> > +		len = 4;
> > +		buf[0] = VARINT_4_PREFIX | (uint8_t)(val >> VARINT_4_SHIFT);
> > +		goto four;
> > +	} else if (val <= VARINT_5_MAX) {
> > +		val -= VARINT_5_MIN;
> > +		len = 5;
> > +		buf[0] = VARINT_5_PREFIX | (uint8_t)(val >> VARINT_5_SHIFT);
> > +		goto five;
> > +	} else if (val <= VARINT_6_MAX) {
> > +		val -= VARINT_6_MIN;
> > +		len = 6;
> > +		buf[0] = VARINT_6_PREFIX | (uint8_t)(val >> VARINT_6_SHIFT);
> > +		goto six;
> > +	} else if (val <= VARINT_7_MAX) {
> > +		val -= VARINT_7_MIN;
> > +		len = 7;
> > +		buf[0] = VARINT_7_PREFIX | (uint8_t)(val >> VARINT_7_SHIFT);
> > +		goto seven;
> > +	} else if (val <= VARINT_8_MAX) {
> > +		val -= VARINT_8_MIN;
> > +		len = 8;
> > +		buf[0] = VARINT_8_PREFIX | (uint8_t)(val >> VARINT_8_SHIFT);
> > +		goto eight;
> > +	}
> > +
> > +	val -= VARINT_9_MIN;
> > +	len = 9;
> > +	buf[0] = VARINT_9_PREFIX;
> > +	buf[8] = (uint8_t)((val >> 56) & 0xff);
> > +eight:
> > +	buf[7] = (uint8_t)((val >> 48) & 0xff);
> > +seven:
> > +	buf[6] = (uint8_t)((val >> 40) & 0xff);
> > +six:
> > +	buf[5] = (uint8_t)((val >> 32) & 0xff);
> > +five:
> > +	buf[4] = (uint8_t)((val >> 24) & 0xff);
> > +four:
> > +	buf[3] = (uint8_t)((val >> 16) & 0xff);
> > +three:
> > +	buf[2] = (uint8_t)((val >> 8) & 0xff);
> > +two:
> > +	buf[1] = (uint8_t)(val & 0xff);
> > +
> > +	return len;
> > +}
> > +
> > +uint64_t
> > +dt_vint2int(const char *str)
> > +{
> > +	const uint8_t	*buf = (const uint8_t *)str;
> > +	uint64_t	hi = buf[0];
> > +	uint64_t	bs;
> > +	uint64_t	len = 0;
> > +
> > +	if (hi < VARINT_1_PLIM)	  /* 0xxxxxxx -> 0x00 - 0x7f */
> > +		return hi;
> > +	if (hi < VARINT_2_PLIM) { /* 10xxxxxx -> 0x0080 - 0x407f */
> > +		hi &= VARINT_HI_MASK(VARINT_2_PLIM);
> > +		hi <<= VARINT_2_SHIFT;
> > +		bs = VARINT_2_MIN;
> > +		goto two;
> > +	}
> > +	if (hi < VARINT_3_PLIM) { /* 110xxxxx -> 0x4080 - 0x20407f */
> > +		hi &= VARINT_HI_MASK(VARINT_3_PLIM);
> > +		hi <<= VARINT_3_SHIFT;
> > +		bs = VARINT_3_MIN;
> > +		goto three;
> > +	}
> > +	if (hi < VARINT_4_PLIM) { /* 1110xxxx -> 0x204080 - 0x1020407f */
> > +		hi &= VARINT_HI_MASK(VARINT_4_PLIM);
> > +		hi <<= VARINT_4_SHIFT;
> > +		bs = VARINT_4_MIN;
> > +		goto four;
> > +	}
> > +	if (hi < VARINT_5_PLIM) { /* 11110xxx -> 0x10204080 - 0x081020407f */
> > +		hi &= VARINT_HI_MASK(VARINT_5_PLIM);
> > +		hi <<= VARINT_5_SHIFT;
> > +		bs = VARINT_5_MIN;
> > +		goto five;
> > +	}
> > +	if (hi < VARINT_6_PLIM) { /* 111110xx -> 0x0810204080 -> 0x4081020407f */
> > +		hi &= VARINT_HI_MASK(VARINT_6_PLIM);
> > +		hi <<= VARINT_6_SHIFT;
> > +		bs = VARINT_6_MIN;
> > +		goto six;
> > +	}
> > +	if (hi < VARINT_7_PLIM) { /* 1111110x -> 0x40810204080 -> 0x204081020407f */
> > +		hi &= VARINT_HI_MASK(VARINT_7_PLIM);
> > +		hi <<= VARINT_7_SHIFT;
> > +		bs = VARINT_7_MIN;
> > +		goto seven;
> > +	}
> > +	if (hi < VARINT_8_PLIM) { /* 11111110 -> 0x2040810204080 -> 0x10204081020407f */
> > +		hi = 0;
> > +		bs = VARINT_8_MIN;
> > +		goto eight;
> > +	}
> > +
> > +	/* 11111111 -> 0x102040810204080 - 0xffffffffffffffff */
> > +	hi = 0;
> > +	bs = VARINT_9_MIN;
> > +
> > +	len += ((uint64_t)buf[8]) << 56;
> > +eight:
> > +	len += ((uint64_t)buf[7]) << 48;
> > +seven:
> > +	len += ((uint64_t)buf[6]) << 40;
> > +six:
> > +	len += ((uint64_t)buf[5]) << 32;
> > +five:
> > +	len += ((uint64_t)buf[4]) << 24;
> > +four:
> > +	len += ((uint64_t)buf[3]) << 16;
> > +three:
> > +	len += ((uint64_t)buf[2]) << 8;
> > +two:
> > +	len += (uint64_t)buf[1];
> > +	len += hi;
> > +
> > +	return bs + len;
> > +}
> > +
> > +int
> > +dt_vint_size(uint64_t val)
> > +{
> > +	if (val <= VARINT_1_MAX)
> > +		return 1;
> > +	if (val <= VARINT_2_MAX)
> > +		return 2;
> > +	if (val <= VARINT_3_MAX)
> > +		return 3;
> > +	if (val <= VARINT_4_MAX)
> > +		return 4;
> > +	if (val <= VARINT_5_MAX)
> > +		return 5;
> > +	if (val <= VARINT_6_MAX)
> > +		return 6;
> > +	if (val <= VARINT_7_MAX)
> > +		return 7;
> > +	if (val <= VARINT_8_MAX)
> > +		return 8;
> > +
> > +	return 9;
> > +}
> > +
> > +const char *
> > +dt_vint_skip(const char *str)
> > +{
> > +	const uint8_t	*buf = (const uint8_t *)str;
> > +	uint64_t	hi = buf[0];
> > +
> > +	if (hi < VARINT_1_PLIM)	 /* 0xxxxxxx -> 0x00 - 0x7f */
> > +		return &str[1];
> > +	if (hi < VARINT_2_PLIM)  /* 10xxxxxx -> 0x0080 - 0x407f */
> > +		return &str[2];
> > +	if (hi < VARINT_3_PLIM)  /* 110xxxxx -> 0x4080 - 0x20407f */
> > +		return &str[3];
> > +	if (hi < VARINT_4_PLIM)  /* 1110xxxx -> 0x204080 - 0x1020407f */
> > +		return &str[4];
> > +	if (hi < VARINT_5_PLIM)  /* 11110xxx -> 0x10204080 - 0x081020407f */
> > +		return &str[5];
> > +	if (hi < VARINT_6_PLIM)  /* 111110xx -> 0x0810204080 -> 0x4081020407f */
> > +		return &str[6];
> > +	if (hi < VARINT_7_PLIM)  /* 1111110x -> 0x40810204080 -> 0x204081020407f */
> > +		return &str[7];
> > +	if (hi < VARINT_8_PLIM)  /* 11111110 -> 0x2040810204080 -> 0x10204081020407f */
> > +		return &str[8];
> > +
> > +	return &str[9];
> > +}
> > diff --git a/libdtrace/dt_varint.h b/libdtrace/dt_varint.h
> > new file mode 100644
> > index 00000000..fe68153b
> > --- /dev/null
> > +++ b/libdtrace/dt_varint.h
> > @@ -0,0 +1,82 @@
> > +/*
> > + * Oracle Linux DTrace.
> > + * Copyright (c) 2021, Oracle and/or its affiliates. All rights reserved.
> > + * Licensed under the Universal Permissive License v 1.0 as shown at
> > + * http://oss.oracle.com/licenses/upl.
> > + */
> > +
> 
> How about adding something like this to dt_varint.h?

Sure.  This is helpful.

> /*
>   * Variable-length integers
>   *
>   * These functions convert between uint64_t integers and strings of 1-9
>   * bytes.  The first 1<<7 integers are stored in a single byte with leading
>   * bit 0.  The next 1<<14 integers are stored in two bytes with leading 
> bits
>   * 1 and 0.  And so on.  Here are the ranges of integers:
>   *
>   *      minimum integer    # of    leading    # of bits
>   *        in this range    bytes     byte     left over       # of integers
>   *                                (in bits)   for values      in this range
>   *
>   *                    0      1     0???????    1*8-1= 7                0x80
>   *                 0x80      2     10??????    2*8-2=14 0x4000
>   *               0x4080      3     110?????    3*8-3=21 0x200000
>   *             0x204080      4     1110????    4*8-4=28 0x10000000
>   *           0x10204080      5     11110???    5*8-5=35 0x800000000
>   *          0x810204080      6     111110??    6*8-6=42 0x40000000000
>   *        0x40810204080      7     1111110?    7*8-7=49 0x2000000000000
>   *      0x2040810204080      8     11111110    8*8-8=56 0x100000000000000
>   *    0x102040810204080      9     11111111    9*8-8=64        1 << 64
>   *
>   * If n is the number of bytes:
>   *   VARINT_$n_PREFIX  =  leading byte (with 0 for ?), shown above
>   *   VARINT_$n_PLIM    =  VARINT_${n+1}_PREFIX
>   *   VARINT_$n_SHIFT   =  8*(n-1)
>   *   VARINT_$n_MIN     =  VARINT_${n-1}_MAX + 1   with VARINT_1_MIN = 0
>   *   VARINT_$n_MAX     =  inclusive maximum
>   *
>   * Notice that since we go up to at most 9 bytes.  So if the first 8 bits
>   * are 1s, we we know the next 8 bytes represent the value.
>   */
> 
> 
> > +#ifndef	_DT_VARINT_H
> > +#define	_DT_VARINT_H
> > +
> > +#ifdef	__cplusplus
> > +extern "C" {
> > +#endif
> > +
> > +#define VARINT_HI_MASK(b)	((uint8_t)~(b))
> > +
> > +#define VARINT_1_PREFIX		(uint8_t)0x00
> > +#define VARINT_1_PLIM		0x80
> > +#define VARINT_1_SHIFT		0
> > +#define VARINT_1_MIN		0
> > +#define VARINT_1_MAX		0x7f
> > +
> > +#define VARINT_2_PREFIX		(uint8_t)0x80
> > +#define VARINT_2_PLIM		0xc0
> > +#define VARINT_2_SHIFT		8
> > +#define VARINT_2_MIN		(VARINT_1_MAX + 1)
> > +#define VARINT_2_MAX		0x407f
> > +
> > +#define VARINT_3_PREFIX		(uint8_t)0xc0
> > +#define VARINT_3_PLIM		0xe0
> > +#define VARINT_3_SHIFT		16
> > +#define VARINT_3_MIN		(VARINT_2_MAX + 1)
> > +#define VARINT_3_MAX		0x20407f
> > +
> > +#define VARINT_4_PREFIX		(uint8_t)0xe0
> > +#define VARINT_4_PLIM		0xf0
> > +#define VARINT_4_SHIFT		24
> > +#define VARINT_4_MIN		(VARINT_3_MAX + 1)
> > +#define VARINT_4_MAX		0x1020407f
> > +
> > +#define VARINT_5_PREFIX		(uint8_t)0xf0
> > +#define VARINT_5_PLIM		0xf8
> > +#define VARINT_5_SHIFT		32
> > +#define VARINT_5_MIN		(VARINT_4_MAX + 1)
> > +#define VARINT_5_MAX		0x081020407f
> > +
> > +#define VARINT_6_PREFIX		(uint8_t)0xf8
> > +#define VARINT_6_PLIM		0xfc
> > +#define VARINT_6_SHIFT		40
> > +#define VARINT_6_MIN		(VARINT_5_MAX + 1)
> > +#define VARINT_6_MAX		0x04081020407f
> > +
> > +#define VARINT_7_PREFIX		(uint8_t)0xfc
> > +#define VARINT_7_PLIM		0xfe
> > +#define VARINT_7_SHIFT		48
> > +#define VARINT_7_MIN		(VARINT_6_MAX + 1)
> > +#define VARINT_7_MAX		0x0204081020407f
> > +
> > +#define VARINT_8_PREFIX		(uint8_t)0xfe
> > +#define VARINT_8_PLIM		0xff
> > +#define VARINT_8_SHIFT		56
> > +#define VARINT_8_MIN		(VARINT_7_MAX + 1)
> > +#define VARINT_8_MAX		0x010204081020407f
> > +
> > +#define VARINT_9_PREFIX		(uint8_t)0xff
> > +#define VARINT_9_PLIM		0xff
> > +#define VARINT_9_SHIFT		0
> > +#define VARINT_9_MIN		(VARINT_8_MAX + 1)
> > +#define VARINT_9_MAX		0xffffffffffffffff
> 
> Again, dt_varint.h defines a number of symbols that are never used. It 
> would be clearer not to define them.  "Completion" makes no sense since 
> clearly things at the ends are different;  besides, the actual code 
> makes no effort to demonstrate such "symmetry."  And to my taste, 
> eliminating some of these symbols and using their definitions would make 
> the code a lot more readable and easier to verify.  Like replacing the 
> SHIFTs with their numerical values and the n_MAX with {n+1}_MIN - 1.

I (obviously0 disagree.  I see no problem with providing definitions for these
values for all cases of n in [1, 9].  I can change VARINT_n_PREFIX to be
VARINT_(n-1)_PLIM to make that sequence more clear rather than having the values
in there (since it is an actual sequence).  But especially when adding in the
helpful comment block you provided, it makes sense to show the definitions for
all cases.

I already commented above why I prefer to use the symbolic names rather than
straight numeric values in the code.  Again, with adding in the comment block I
think that is even more of an advantage because it relates the values to one
another more clearly.
> 
> > +
> > +#define VARINT_MAX_BYTES	9
> > +
> > +extern int dt_int2vint(uint64_t num, char *str);
> > +extern uint64_t dt_vint2int(const char *str);
> > +extern int dt_vint_size(uint64_t val);
> > +extern const char *dt_vint_skip(const char *str);
> > +
> > +#ifdef	__cplusplus
> > +}
> > +#endif
> > +
> > +#endif	/* _DT_VARINT_H */
> > diff --git a/test/internals/tst.varint.c b/test/internals/tst.varint.c
> > new file mode 100644
> > index 00000000..7011b488
> > --- /dev/null
> > +++ b/test/internals/tst.varint.c
> > @@ -0,0 +1,130 @@
> > +/* @@link: -Ilibdtrace */
> > +
> > +#include <stdint.h>
> > +#include <stdio.h>
> > +#include <stdlib.h>
> > +#include <dt_varint.h>
> > +
> > +void
> > +check_length(uint64_t val, int len, int exp)
> > +{
> > +	if (len != exp) {
> > +		printf("Length wrong for %lu: %d vs %d\n", val, len, exp);
> > +		exit(1);
> > +	}
> > +}
> > +
> > +void
> > +check_size(uint64_t val, int len, int exp)
> > +{
> > +	if (len != exp) {
> > +		printf("Size wrong for %lu: %d vs %d\n", val, len, exp);
> > +		exit(1);
> > +	}
> > +}
> 
> Should "len" be "siz"?  Looks like some copy-and-paste side effect.

Why does it matter?  Whether it is a length or a size, it is still the same
value.  And in fact, when you look at the caller, we *are* comparing against
an expected prefix length.  So, we compare lemngth against expected length.

> > +
> > +void
> > +check_value(uint64_t dval, uint64_t eval, int len)
> > +{
> > +	if (dval != eval) {
> > +		printf("Value decode error (%d byte prefix): %lx vs %lx\n",
> > +		       len, dval, eval);
> > +		exit(1);
> > +	}
> > +}
> > +
> > +void
> > +check_range(uint64_t lo, uint64_t hi, int len)
> > +{
> > +	uint64_t	val;
> > +	char		s[VARINT_MAX_BYTES];
> > +	int		rc;
> > +
> > +	for (val = lo - 10000; val < lo + 10000; val++) {
> > +		rc = dt_int2vint(val, s);
> > +		if (val < lo) {
> > +			check_length(val, rc, len - 1);
> > +			check_size(val, dt_vint_size(val), len - 1);
> > +		} else {
> > +			check_length(val, rc, len);
> > +			check_size(val, dt_vint_size(val), len);
> > +		}
> > +
> > +		check_value(dt_vint2int(s), val, rc);
> > +	}
> 
> Actually, that loop body is repeated quite a few times in this test.  
> Might as well put it into a function and just call that function instead 
> of always replicating the code.  I'll send an example in a patch, but it 
> is not meant to be a separate patch. It's intended to be squashed into 
> this one.

Comments on this are in email reply to that patch.

> > +
> > +	for (val = hi - 10000; val < hi + 10000; val++) {
> > +		rc = dt_int2vint(val, s);
> > +		if (val <= hi) {
> > +			check_length(val, rc, len);
> > +			check_size(val, dt_vint_size(val), len);
> > +		} else {
> > +			check_length(val, rc, len + 1);
> > +			check_size(val, dt_vint_size(val), len + 1);
> > +		}
> > +
> > +		check_value(dt_vint2int(s), val, rc);
> > +	}
> > +}
> > +
> > +int
> > +main(void)
> > +{
> > +	uint64_t	val;
> > +	char		s[VARINT_MAX_BYTES];
> > +	int		rc;
> > +
> > +	/* First range: we go through all 16-bit values. */
> > +	for (val = 0; val < 0xffff; val++) {
> > +		rc = dt_int2vint(val, s);
> > +		if (val <= VARINT_1_MAX) {
> > +			check_length(val, rc, 1);
> > +			check_size(val, dt_vint_size(val), 1);
> > +		} else if (val <= VARINT_2_MAX) {
> > +			check_length(val, rc, 2);
> > +			check_size(val, dt_vint_size(val), 2);
> > +		} else {
> > +			check_length(val, rc, 3);
> > +			check_size(val, dt_vint_size(val), 3);
> > +		}
> > +
> > +		check_value(dt_vint2int(s), val, rc);
> > +	}
> > +
> > +	/* For higher ranges, verify the low and high boundary ranges. */
> > +	check_range(VARINT_3_MIN, VARINT_3_MAX, 3);
> > +	check_range(VARINT_4_MIN, VARINT_4_MAX, 4);
> > +	check_range(VARINT_5_MIN, VARINT_5_MAX, 5);
> > +	check_range(VARINT_6_MIN, VARINT_6_MAX, 6);
> > +	check_range(VARINT_7_MIN, VARINT_7_MAX, 7);
> > +	check_range(VARINT_8_MIN, VARINT_8_MAX, 8);
> > +
> > +	/* Verify the final range. */
> > +	for (val = VARINT_9_MIN - 10000; val < VARINT_9_MIN + 10000; val++) {
> > +		rc = dt_int2vint(val, s);
> > +		if (val < VARINT_9_MIN) {
> > +			check_length(val, rc, 8);
> > +			check_size(val, dt_vint_size(val), 8);
> > +		} else {
> > +			check_length(val, rc, 9);
> > +			check_size(val, dt_vint_size(val), 9);
> > +		}
> > +
> > +		check_value(dt_vint2int(s), val, rc);
> > +	}
> > +
> > +	for (val = VARINT_9_MAX - 10000; val < VARINT_9_MAX; val++) {
> > +		rc = dt_int2vint(val, s);
> > +		check_length(val, rc, 9);
> > +		check_size(val, dt_vint_size(val), 9);
> > +
> > +		check_value(dt_vint2int(s), val, rc);
> > +	}
> > +
> > +	rc = dt_int2vint(VARINT_9_MAX, s);
> > +	check_length(VARINT_9_MAX, rc, 9);
> > +	check_size(VARINT_9_MAX, dt_vint_size(VARINT_9_MAX), 9);
> > +	check_value(dt_vint2int(s), VARINT_9_MAX, rc);
> > +}
> > +
> > +#include "dt_varint.c"
> 
> _______________________________________________
> DTrace-devel mailing list
> DTrace-devel at oss.oracle.com
> https://oss.oracle.com/mailman/listinfo/dtrace-devel