acoustic-carpenter-78188
07/16/2023, 11:13 AMpandas.DataFrame
or arrow.Tables
, to duckdb transparently.
@task
def t1() -> Annotated[StructuredDataset, kwtypes(len=int)]:
df = pd.DataFrame({"name": ["dylan", "steve"],"age": [33, 32]})
return StructuredDataset(df, uri=duckdb_uri) # flytekit will write pandas dataframe to duckdb
Goal: What should the final outcome look like, ideally?
Add a duckDB plugin in flytekit, and add a structured dataset encoder and decoder.
Here is an example to add custom encoder/decoder in flytekit-plugin
Describe alternatives you've considered
No response
Propose: Link/Inline OR Additional context
No response
Are you sure this issue hasn't been raised already?
☑︎ Yes
Have you read the Code of Conduct?
☑︎ Yes
flyteorg/flyteacoustic-carpenter-78188
07/16/2023, 11:13 AM