API Reference

intake_avro.source.AvroTableSource(urlpath) Source to load tabular avro datasets.
intake_avro.source.AvroSequenceSource(urlpath) Source to load avro datasets as sequence of python dicts.
class intake_avro.source.AvroTableSource(urlpath, metadata=None, storage_options=None)[source]

Source to load tabular avro datasets.

Parameters:
urlpath: str

Location of the data files; can include protocol and glob characters.

Attributes:
datashape
description
hvplot

Returns a hvPlot object to provide a high-level plotting API.

name
plot

Returns a hvPlot object to provide a high-level plotting API.

Methods

close() Close open resources corresponding to this data source.
discover() Open resource and populate the source attributes.
read() Load entire dataset into a container and return it
read_chunked() Return iterator over container fragments of data source
read_partition(i) Return a (offset_tuple, container) corresponding to i-th partition.
to_dask() Create lazy dask dataframe object
yaml() Return YAML representation of this data-source
read()[source]

Load entire dataset into a container and return it

to_dask()[source]

Create lazy dask dataframe object

class intake_avro.source.AvroSequenceSource(urlpath, metadata=None, storage_options=None)[source]

Source to load avro datasets as sequence of python dicts.

Parameters:
urlpath: str

Location of the data files; can include protocol and glob characters.

Attributes:
datashape
description
hvplot

Returns a hvPlot object to provide a high-level plotting API.

name
plot

Returns a hvPlot object to provide a high-level plotting API.

Methods

close() Close open resources corresponding to this data source.
discover() Open resource and populate the source attributes.
read() Load entire dataset into a container and return it
read_chunked() Return iterator over container fragments of data source
read_partition(i) Return a (offset_tuple, container) corresponding to i-th partition.
to_dask() Create lazy dask bag object
yaml() Return YAML representation of this data-source
read()[source]

Load entire dataset into a container and return it

to_dask()[source]

Create lazy dask bag object