glamorous-judge-19392
01/29/2025, 7:22 AMfreezing-airport-6809
freezing-airport-6809
glamorous-judge-19392
01/29/2025, 3:23 PMbland-printer-19092
02/20/2025, 2:03 PM@task
def sd_creator(): -> StructuredDataset
return StructuredDataset(uri="<s3a://my_bucket/path/to/json/>")
@task
def sd_worker(sd_json):
df_json.open(DataFrame).all()...
as it still attempts to read with parquet (even if specifying file_format=“json” to StructuredDataset). Do you have a working example for this? I did get it to work with an explicit read with spark and return StructuredDataset without uri, but it adds overhead.