Albert Wibowo06/13/2023, 9:52 AM
My flyte workflow is quite simple and is as follow:
Matplotlib created a temporary config/cache directory at /tmp/matplotlib-dqrfqnc3 because the default path (/home/flytekit/.config/matplotlib) is not a writable directory; it is highly recommended to set the MPLCONFIGDIR environment variable to a writable directory, in particular to speed up the import of Matplotlib and to better support multiprocessing.
@task() def get_data() -> pd.DataFrame: data_dict = load_breast_cancer(as_frame=True) print(data_dict['data'].head()) return data_dict['data'] @task() def clean_data_pre_split(data:pd.DataFrame) -> pd.DataFrame: return data @workflow() def training_workflow(): data = get_data() data = clean_data_pre_split(data=data) return
Jay Ganbat06/13/2023, 3:44 PM