cold-teacher-46250
07/16/2025, 8:31 PMjolly-florist-66728
07/16/2025, 8:31 PMcold-teacher-46250
07/16/2025, 8:54 PMjolly-florist-66728
07/16/2025, 8:54 PMcold-teacher-46250
07/18/2025, 7:28 PMjolly-florist-66728
07/18/2025, 7:29 PMcold-teacher-46250
07/18/2025, 7:32 PMdef list_raw_files(dates_to_rerun: Optional[List[str]] = None) -> List[FileInfo]:
and also
def process_files_in_batches(databricks_files: List[FileInfo]) -> List[FileInfo]:
jolly-florist-66728
07/18/2025, 7:32 PMcold-teacher-46250
07/18/2025, 7:33 PMFileInfo(namedtuple("FileInfo", ["path", "name", "size", "modificationTime"])
jolly-florist-66728
07/18/2025, 7:33 PMcold-teacher-46250
07/18/2025, 7:34 PMjolly-florist-66728
07/18/2025, 7:34 PMfrom dataclasses import dataclass
from typing import List, Optional
@dataclass
class FileInfo:
path: str
name: str
size: int
modificationTime: str
def list_raw_files(dates_to_rerun: Optional[List[str]] = None) -> List[FileInfo]:
# Example implementation
return [FileInfo(path="/tmp/a", name="a", size=123, modificationTime="2024-06-01")]
def process_files_in_batches(databricks_files: List[FileInfo]) -> List[FileInfo]:
# Example implementation
return databricks_files
This dataclass approach is fully supported for passing lists of FileInfo between Flyte tasks (source). Would you like more detail or a Flyte task/workflow example?
Sources:
- https://www.union.ai/docs/flyte/user-guide/data-input-output/dataclass/ π 0 button π 0 button π‘ Hint: Mention @runllm in the thread for followups.hallowed-toothbrush-42565
07/23/2025, 8:44 AMcold-teacher-46250
07/23/2025, 4:48 PM