verticapy.read_avro#
- verticapy.read_avro(path: str, schema: str | None = None, table_name: str | None = None, usecols: list | None = None, new_name: dict | None = None, insert: bool = False, reject_on_materialized_type_error: bool = False, flatten_maps: bool = True, flatten_arrays: bool = False, temporary_table: bool = False, temporary_local_table: bool = True, gen_tmp_table_name: bool = True, ingest_local: bool = True, genSQL: bool = False, materialize: bool = True, use_complex_dt: bool = False) vDataFrame #
Ingests an AVRO file using flex tables.
Parameters#
- path: str
Absolute path where the AVRO file is located.
- schema: str, optional
Schema where the AVRO file will be ingested.
- table_name: str, optional
Final relation name.
- usecols: list, optional
list
of the AVRO parameters to ingest. The other ones will be ignored. If empty, all the AVRO parameters will be ingested.- new_name: dict, optional
Dictionary of the new column names. If the AVRO file is nested, it is recommended to change the final names because special characters will be included in the new column names. For example,
{"param": {"age": 3, "name": Badr}, "date": 1993-03-11}
will create 3 columns: “param.age”, “param.name” and “date”. You can rename these columns using thenew_name
parameter with the followingdictionary
:{"param.age": "age", "param.name": "name"}
- insert: bool, optional
If set to
True
, the data will be ingested to the input relation. The AVRO parameters must be the same as the input relation otherwise they will not be ingested. If set toTrue
,table_name
cannot be empty.- reject_on_materialized_type_error: bool, optional
boolean
, whether to reject a data row that contains a materialized column value that cannot be coerced into a compatible data type. If the value isFalse
and the type cannot be coerced, the parser sets the value in that column toNone
. If the column is a strongly-typed complex type, as opposed to a flexible complex type, then a type mismatch anywhere in the complex type causes the entire column to be treated as a mismatch. The parser does not partially load complex types.- flatten_maps: bool, optional
boolean
, whether to flatten sub-maps within the AVRO data, separating map levels with a period (.). This value affects all data in the load, including nested maps.- flatten_arrays: bool, optional
boolean
, whether to convert lists to sub-maps withinteger
keys. When lists are flattened, key names are concatenated in the same way as maps.lists
are not flattened by default. This value affects all data in the load, including nestedlists
.- temporary_table: bool, optional
If set to
True
, a temporary table will be created.- temporary_local_table: bool, optional
If set to
True
, a temporary local table will be created. The parameterschema
must be empty, otherwise this parameter is ignored.- gen_tmp_table_name: bool, optional
Sets the name of the temporary table. This parameter is only used when the parameter
temporary_local_table
is set toTrue
and if the parameterstable_name
andschema
are unspecified.- ingest_local: bool, optional
If set to
True
, the file will be ingested from the local machine.- genSQL: bool, optional
If set to
True
, the SQL code for creating the final table is generated but not executed. This is a good way to change the final relation types or to customize the data ingestion.- materialize: bool, optional
If set to
True
, the flex table is materialized into a table. Otherwise, it will remain a flex table. Flex tables simplify the data ingestion but have worse performace compared to regular tables.- use_complex_dt: bool, optional
boolean
, whether the input data file has complex structure. If set toTrue
, most of the other parameters are ignored.
Returns#
- vDataFrame
The
vDataFrame
of the relation.
Examples#
In this example, we will first download an AVRO file and then ingest it into Vertica database.
We import
verticapy
:import verticapy as vp
Hint
By assigning an alias to
verticapy
, we mitigate the risk of code collisions with other libraries. This precaution is necessary because verticapy uses commonly known function names like “average” and “median”, which can potentially lead to naming conflicts. The use of an alias ensures that the functions fromverticapy
are used as intended without interfering with functions from other libraries.Let’s download the AVRO file.
import requests url = "https://github.com/vertica/VerticaPy/raw/master/verticapy/tests/utilities/variants.avro" r = requests.get(url) open('variants.avro', 'wb').write(r.content) Out[5]: 1952604
Let’s ingest the AVRO file into the Vertica database.
from verticapy.core.parsers.avro import read_avro read_avro( path = "variants.avro", table_name = "variants", schema = "public", )
AbctypeVarchar(20)AbcsvVarchar(20)AbcLong varchar(162500)AbcstrandVarchar(20)123startIntegerAbcreferenceVarchar(32)AbcnamesLong varchar(40)123lengthIntegerAbcidVarchar(22)123endInteger123chromosomeIntegerAbcLong varchar(1155)AbcVarchar(20)123annotation.startIntegerAbcannotation.referenceVarchar(32)Abcannotation.populationFrequenciesLong varchar(40)Abcannotation.minorAlleleFreqVarchar(20)Abcannotation.minorAlleleVarchar(20)Abcannotation.idVarchar(20)Abcannotation.hgvsVarchar(20)Abcannotation.geneTraitAssociationLong varchar(40)Abcannotation.geneExpressionVarchar(20)Abcannotation.geneDrugInteractionLong varchar(40)AbcLong varchar(560)AbcVarchar(20)AbcLong varchar(800)AbcLong varchar(8090)123annotation.chromosomeIntegerAbcannotation.ancestralAlleleVarchar(20)Abcannotation.alternateVarchar(20)Abcannotation.additionalAttributesVarchar(20)Abcannotation.__name__Varchar(34)AbcalternateVarchar(20)Abc__name__Varchar(22)1 INDEL [null] + 16050740 A {} 1 rs587747231 16050740 22 16050740 A {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 2 INDEL [null] + 16051723 A {} 1 rs201906224 16051723 22 16051723 A {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 3 INDEL [null] + 16052395 AAAGCCAGAACCACTC {} 16 rs587774030 16052410 22 16052395 AAAGCCAGAACCACTC {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 4 INDEL [null] + 16055850 TT {} 2 rs587752360 16055851 22 16055850 TT {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 5 INDEL [null] + 16055901 T {} 1 rs587649799 16055901 22 16055901 T {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 6 INDEL [null] + 16056484 AG {} 2 rs587703083 16056485 22 16056484 AG {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 7 INDEL [null] + 16057193 A {} 1 rs587689210 16057193 22 16057193 A {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 8 INDEL [null] + 16061832 {} 1 rs587714792 16061832 22 16061832 {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 9 INDEL [null] + 16063429 AG {} 2 rs587680732 16063430 22 16063429 AG {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 10 INDEL [null] + 16063482 AA {} 2 rs587700504 16063483 22 16063482 AA {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 11 INDEL [null] + 16066472 C {} 1 rs587669040 16066472 22 16066472 C {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 12 INDEL [null] + 16070324 T {} 1 rs587727612 16070324 22 16070324 T {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 13 INDEL [null] + 16080425 TA {} 2 rs543349252 16080426 22 16080425 TA {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 14 INDEL [null] + 16085566 {} 4 rs561100161 16085569 22 16085566 {} [null] [null] [null] [null] {} [null] {} 22 [null] TTTC [null] VariantAnnotation TTTC VariantAvro 15 INDEL [null] + 16140743 TATC {} 4 rs577706315 16140746 22 16140743 TATC {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 16 INDEL [null] + 16141583 {} 1 rs545132695 16141583 22 16141583 {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 17 INDEL [null] + 16142235 CT {} 2 rs554362668 16142236 22 16142235 CT {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 18 INDEL [null] + 16145459 T {} 1 rs201309305 16145459 22 16145459 T {} [null] [null] [null] [null] {} [null] {} 22 [null] [null] VariantAnnotation VariantAvro 19 SNP [null] + 16050075 A {} 1 rs587697622 16050075 22 16050075 A {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 20 SNP [null] + 16050115 G {} 1 rs587755077 16050115 22 16050115 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 21 SNP [null] + 16050213 C {} 1 rs587654921 16050213 22 16050213 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 22 SNP [null] + 16050319 C {} 1 rs587712275 16050319 22 16050319 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 23 SNP [null] + 16050527 C {} 1 rs587769434 16050527 22 16050527 C {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 24 SNP [null] + 16050568 C {} 1 rs587638893 16050568 22 16050568 C {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 25 SNP [null] + 16050607 G {} 1 rs587720402 16050607 22 16050607 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 26 SNP [null] + 16050627 G {} 1 rs587593704 16050627 22 16050627 G {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 27 SNP [null] + 16050646 G {} 1 rs587670191 16050646 22 16050646 G {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 28 SNP [null] + 16050655 G {} 1 rs587703534 16050655 22 16050655 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 29 SNP [null] + 16050678 C {} 1 rs139377059 16050678 22 16050678 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 30 SNP [null] + 16050679 G {} 1 rs587682556 16050679 22 16050679 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 31 SNP [null] + 16050688 C {} 1 rs587756191 16050688 22 16050688 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 32 SNP [null] + 16050732 C {} 1 rs587652033 16050732 22 16050732 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 33 SNP [null] + 16050758 T {} 1 rs587684957 16050758 22 16050758 T {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 34 SNP [null] + 16050783 A {} 1 rs587743568 16050783 22 16050783 A {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 35 SNP [null] + 16050840 C {} 1 rs587616822 16050840 22 16050840 C {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 36 SNP [null] + 16050847 T {} 1 rs587702478 16050847 22 16050847 T {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 37 SNP [null] + 16050856 G {} 1 rs587754502 16050856 22 16050856 G {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 38 SNP [null] + 16050874 G {} 1 rs587634452 16050874 22 16050874 G {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 39 SNP [null] + 16050922 T {} 1 rs367963583 16050922 22 16050922 T {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 40 SNP [null] + 16050954 G {} 1 rs587763973 16050954 22 16050954 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 41 SNP [null] + 16050958 A {} 1 rs587636807 16050958 22 16050958 A {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 42 SNP [null] + 16050972 G {} 1 rs587709853 16050972 22 16050972 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 43 SNP [null] + 16050984 C {} 1 rs188945759 16050984 22 16050984 C {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 44 SNP [null] + 16050994 G {} 1 rs7288968 16050994 22 16050994 G {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 45 SNP [null] + 16050996 T {} 1 rs587706759 16050996 22 16050996 T {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 46 SNP [null] + 16051075 G {} 1 rs587625303 16051075 22 16051075 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 47 SNP [null] + 16051164 G {} 1 rs587698813 16051164 22 16051164 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 48 SNP [null] + 16051165 C {} 1 rs587731798 16051165 22 16051165 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 49 SNP [null] + 16051246 G {} 1 rs587627608 16051246 22 16051246 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 50 SNP [null] + 16051250 G {} 1 rs587742665 16051250 22 16051250 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 51 SNP [null] + 16051269 G {} 1 rs587623720 16051269 22 16051269 G {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 52 SNP [null] + 16051432 A {} 1 rs587672056 16051432 22 16051432 A {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 53 SNP [null] + 16051477 C {} 1 rs192339082 16051477 22 16051477 C {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 54 SNP [null] + 16051493 G {} 1 rs587740681 16051493 22 16051493 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 55 SNP [null] + 16051564 T {} 1 rs587614024 16051564 22 16051564 T {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 56 SNP [null] + 16051644 C {} 1 rs587696528 16051644 22 16051644 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 57 SNP [null] + 16051657 C {} 1 rs587748548 16051657 22 16051657 C {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 58 SNP [null] + 16051771 A {} 1 rs587650583 16051771 22 16051771 A {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 59 SNP [null] + 16051777 G {} 1 rs587678958 16051777 22 16051777 G {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 60 SNP [null] + 16051796 A {} 1 rs587772527 16051796 22 16051796 A {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 61 SNP [null] + 16051816 T {} 1 rs587674912 16051816 22 16051816 T {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 62 SNP [null] + 16051874 A {} 1 rs587731473 16051874 22 16051874 A {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 63 SNP [null] + 16051925 G {} 1 rs587631814 16051925 22 16051925 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 64 SNP [null] + 16051926 G {} 1 rs587639206 16051926 22 16051926 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 65 SNP [null] + 16051927 A {} 1 rs587724895 16051927 22 16051927 A {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 66 SNP [null] + 16051930 C {} 1 rs587599314 16051930 22 16051930 C {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 67 SNP [null] + 16051952 G {} 1 rs587672393 16051952 22 16051952 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 68 SNP [null] + 16052032 G {} 1 rs587723851 16052032 22 16052032 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 69 SNP [null] + 16052079 C {} 1 rs587605217 16052079 22 16052079 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 70 SNP [null] + 16052097 G {} 1 rs2844865 16052097 22 16052097 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 71 SNP [null] + 16052111 C {} 1 rs587618093 16052111 22 16052111 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 72 SNP [null] + 16052112 C {} 1 rs187181153 16052112 22 16052112 C {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 73 SNP [null] + 16052126 T {} 1 rs587729066 16052126 22 16052126 T {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 74 SNP [null] + 16052159 T {} 1 rs191584855 16052159 22 16052159 T {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 75 SNP [null] + 16052240 C {} 1 rs184458566 16052240 22 16052240 C {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 76 SNP [null] + 16052271 G {} 1 rs188996808 16052271 22 16052271 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 77 SNP [null] + 16052379 C {} 1 rs587674789 16052379 22 16052379 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 78 SNP [null] + 16052384 G {} 1 rs587713264 16052384 22 16052384 G {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 79 SNP [null] + 16052428 G {} 1 rs587776127 16052428 22 16052428 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 80 SNP [null] + 16052492 A {} 1 rs587719729 16052492 22 16052492 A {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 81 SNP [null] + 16052576 A {} 1 rs587596237 16052576 22 16052576 A {} [null] [null] [null] [null] {} [null] {} 22 [null] G [null] VariantAnnotation G VariantAvro 82 SNP [null] + 16052639 C {} 1 rs142442817 16052639 22 16052639 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 83 SNP [null] + 16052730 G {} 1 rs587605101 16052730 22 16052730 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 84 SNP [null] + 16052742 G {} 1 rs587665048 16052742 22 16052742 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 85 SNP [null] + 16052837 C {} 1 rs587743102 16052837 22 16052837 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 86 SNP [null] + 16052853 G {} 1 rs149990453 16052853 22 16052853 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 87 SNP [null] + 16052872 G {} 1 rs587672570 16052872 22 16052872 G {} [null] [null] [null] [null] {} [null] {} 22 [null] C [null] VariantAnnotation C VariantAvro 88 SNP [null] + 16052957 C {} 1 rs587740531 16052957 22 16052957 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 89 SNP [null] + 16052973 C {} 1 rs587668063 16052973 22 16052973 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 90 SNP [null] + 16053031 G {} 1 rs587654148 16053031 22 16053031 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 91 SNP [null] + 16053050 G {} 1 rs587711202 16053050 22 16053050 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 92 SNP [null] + 16053107 C {} 1 rs375566279 16053107 22 16053107 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 93 SNP [null] + 16053115 G {} 1 rs587610005 16053115 22 16053115 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 94 SNP [null] + 16053116 C {} 1 rs587692532 16053116 22 16053116 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 95 SNP [null] + 16053127 C {} 1 rs587744637 16053127 22 16053127 C {} [null] [null] [null] [null] {} [null] {} 22 [null] T [null] VariantAnnotation T VariantAvro 96 SNP [null] + 16053138 C {} 1 rs587646627 16053138 22 16053138 C {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 97 SNP [null] + 16053139 G {} 1 rs587710177 16053139 22 16053139 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 98 SNP [null] + 16053202 C {} 1 rs587766208 16053202 22 16053202 C {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 99 SNP [null] + 16053211 G {} 1 rs587640565 16053211 22 16053211 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro 100 SNP [null] + 16053238 G {} 1 rs587718616 16053238 22 16053238 G {} [null] [null] [null] [null] {} [null] {} 22 [null] A [null] VariantAnnotation A VariantAvro Rows: 1-100 | Columns: 34Let’s ingest only two columns.
read_avro( path = "variants.avro", table_name = "variants_usecols", schema = "public", usecols = [ "type", "sv", ], )
AbctypeVarchar(20)AbcsvVarchar(20)1 INDEL [null] 2 INDEL [null] 3 INDEL [null] 4 INDEL [null] 5 INDEL [null] 6 INDEL [null] 7 INDEL [null] 8 INDEL [null] 9 INDEL [null] 10 INDEL [null] 11 INDEL [null] 12 INDEL [null] 13 INDEL [null] 14 INDEL [null] 15 INDEL [null] 16 INDEL [null] 17 INDEL [null] 18 INDEL [null] 19 SNP [null] 20 SNP [null] 21 SNP [null] 22 SNP [null] 23 SNP [null] 24 SNP [null] 25 SNP [null] 26 SNP [null] 27 SNP [null] 28 SNP [null] 29 SNP [null] 30 SNP [null] 31 SNP [null] 32 SNP [null] 33 SNP [null] 34 SNP [null] 35 SNP [null] 36 SNP [null] 37 SNP [null] 38 SNP [null] 39 SNP [null] 40 SNP [null] 41 SNP [null] 42 SNP [null] 43 SNP [null] 44 SNP [null] 45 SNP [null] 46 SNP [null] 47 SNP [null] 48 SNP [null] 49 SNP [null] 50 SNP [null] 51 SNP [null] 52 SNP [null] 53 SNP [null] 54 SNP [null] 55 SNP [null] 56 SNP [null] 57 SNP [null] 58 SNP [null] 59 SNP [null] 60 SNP [null] 61 SNP [null] 62 SNP [null] 63 SNP [null] 64 SNP [null] 65 SNP [null] 66 SNP [null] 67 SNP [null] 68 SNP [null] 69 SNP [null] 70 SNP [null] 71 SNP [null] 72 SNP [null] 73 SNP [null] 74 SNP [null] 75 SNP [null] 76 SNP [null] 77 SNP [null] 78 SNP [null] 79 SNP [null] 80 SNP [null] 81 SNP [null] 82 SNP [null] 83 SNP [null] 84 SNP [null] 85 SNP [null] 86 SNP [null] 87 SNP [null] 88 SNP [null] 89 SNP [null] 90 SNP [null] 91 SNP [null] 92 SNP [null] 93 SNP [null] 94 SNP [null] 95 SNP [null] 96 SNP [null] 97 SNP [null] 98 SNP [null] 99 SNP [null] 100 SNP [null] Rows: 1-100 | Columns: 2Note
You can ingest multiple AVRO files into the Vertica database by using the following syntax.
read_avro( path = "*.avro", table_name = "variants_multi_files", schema = "public", )
See also
read_csv()
: Ingests a CSV file into the Vertica DB.read_file()
: Ingests an input file into the Vertica DB.read_json()
: Ingests a JSON file into the Vertica DB.read_pandas()
: Ingests thepandas.DataFrame
into the Vertica DB.