public class FrontCodedRabaCoder extends Object implements IRabaCoder, Externalizable
null
values are not allowed.Modifier and Type | Class and Description |
---|---|
static class |
FrontCodedRabaCoder.DefaultFrontCodedRabaCoder
A pre-parameterized version of the
FrontCodedRabaCoder which is
used as the default IRabaCoder for B+Tree keys for both nodes and
leaves. |
Modifier and Type | Field and Description |
---|---|
protected static org.apache.log4j.Logger |
log |
Constructor and Description |
---|
FrontCodedRabaCoder()
De-serialization ctor.
|
FrontCodedRabaCoder(int ratio) |
Modifier and Type | Method and Description |
---|---|
ICodedRaba |
decode(AbstractFixedByteArrayBuffer data)
Return an
IRaba which can access the coded data. |
AbstractFixedByteArrayBuffer |
encode(IRaba raba,
DataOutputBuffer buf)
Encode the data.
|
ICodedRaba |
encodeLive(IRaba raba,
DataOutputBuffer buf)
Encode the data, returning an
ICodedRaba . |
boolean |
isDuplicateKeys()
Return true iff this
IRabaCoder supports duplicate keys. |
boolean |
isKeyCoder()
Return
true if this implementation can code B+Tree keys
(supports search on the coded representation). |
boolean |
isValueCoder()
Return
true if this implementation can code B+Tree values
(allows null s). |
void |
readExternal(ObjectInput in) |
String |
toString() |
void |
writeExternal(ObjectOutput out) |
public FrontCodedRabaCoder()
public FrontCodedRabaCoder(int ratio)
ratio
- The ratio as defined by ByteArrayFrontCodedList
. For
front-coding, compression trades directly for search
performance. Every ratio byte[]s is fully coded. Binary search
is used on the fully coded byte[]s and will identify a bucket
ratio front-coded values. Linear search is then
performed within the bucket of front-coded values in which the
key would be found if it is present. Therefore the ratio is
also the maximum of steps in the linear scan.
Let m := n / ratio
, where n is the #of
entries in the byte[][]
(the size of the total
search problem), m is the size of the binary search
problem and ratio is the size of the linear search problem.
Solving for ratio, we have: ratio := n / m
. Some
examples:
m = n(64)/ratio(16) = 4 m = n(64)/ratio(8) = 8 m = n(64)/ratio(6) ˜ 11 m = n(64)/ratio(4) = 16
public final boolean isKeyCoder()
IRabaCoder
true
if this implementation can code B+Tree keys
(supports search on the coded representation). Note that some
implementations can code either keys or values.isKeyCoder
in interface IRabaCoder
public final boolean isValueCoder()
IRabaCoder
true
if this implementation can code B+Tree values
(allows null
s). Note that some implementations can code
either keys or values.isValueCoder
in interface IRabaCoder
public boolean isDuplicateKeys()
IRabaCoder
IRabaCoder
supports duplicate keys.isDuplicateKeys
in interface IRabaCoder
public void readExternal(ObjectInput in) throws IOException, ClassNotFoundException
readExternal
in interface Externalizable
IOException
ClassNotFoundException
public void writeExternal(ObjectOutput out) throws IOException
writeExternal
in interface Externalizable
IOException
public ICodedRaba encodeLive(IRaba raba, DataOutputBuffer buf)
IRabaCoder
ICodedRaba
. Implementations of this
method should be optimized for the very common use case where the caller
requires immediate access to the coded data record. In that case, many of
the IRabaCoder
implementations can be optimized by passing the
underlying decoding object directly into an alternative constructor for
the ICodedRaba
. The byte[] slice for the coded data record is
available from ICodedRaba.data()
.
This method covers the vast major of the use cases for coding data, which
is to code B+Tree keys or values for a node or leaf that has been evicted
from the AbstractBTree
's write retention queue. The common use
case is to wrap a coded record that was read from an IRawStore
.
The IndexSegmentBuilder
is a special case, since the coded record
will not be used other than to write it on the disk.
encodeLive
in interface IRabaCoder
public AbstractFixedByteArrayBuffer encode(IRaba raba, DataOutputBuffer buf)
IRabaCoder
Note: Implementations of this method are typically heavy. While it is
always valid to IRabaCoder.encode(IRaba, DataOutputBuffer)
an IRaba
, DO NOT invoke this arbitrarily on data which may already be
coded. The ICodedRaba
interface will always be implemented for
coded data.
encode
in interface IRabaCoder
raba
- The data.buf
- A buffer on which the coded data will be written.public ICodedRaba decode(AbstractFixedByteArrayBuffer data)
IRabaCoder
IRaba
which can access the coded data. In general,
implementations SHOULD NOT materialize a backing byte[][]. Instead, the
implementation should access the data in place within the caller's
buffer. Frequently used fields MAY be cached, but the whole point of the
IRabaCoder
is to minimize the in-memory footprint for the B+Tree
by using a coded (aka compressed) representation of the keys and values
whenever possible.decode
in interface IRabaCoder
data
- The record containing the coded data.Copyright © 2006–2019 SYSTAP, LLC DBA Blazegraph. All rights reserved.