Ryuu
12/28/2023, 2:11 AMflytekit>=1.5.0
pandas~=1.5.3
scikit-learn
requests~=2.31.0
numpy~=1.24.4
torch~=2.1.1
albumentations~=1.3.1
torchvision~=0.16.1
boto3~=1.28.64
dataclasses~=0.6
pillow~=10.1.0
deeplake~=3.8.9
matplotlib~=3.7.3
tensorboardx~=2.6.2.2
tqdm~=4.66.1
I have run my project locally successfully. The process registration didn't log any information about this for me to trace the problem.L godlike
12/28/2023, 2:29 AML godlike
12/28/2023, 2:29 AMpyflyte run --remote
?Ryuu
12/28/2023, 3:15 AMRyuu
12/28/2023, 3:15 AMRyuu
12/28/2023, 3:16 AMRyuu
12/28/2023, 3:20 AML godlike
12/28/2023, 3:33 AMYee
I have test successfuly register and run workflow with sandbox, this problem is on Flyte binary
Yee
Ryuu
12/29/2023, 1:21 AMYee
Yee
Ryuu
12/29/2023, 1:36 AMYee
Ryuu
12/29/2023, 1:45 AMYee
Yee
Yee
export FLYTE_SDK_LOGGING_LEVEL=10
and run the register command again?Ryuu
12/29/2023, 2:11 AMadmin@admin:/mnt/data1/hainq/flyte-train$ export FLYTE_SDK_LOGGING_LEVEL=10
admin@admin:/mnt/data1/hainq/flyte-train$ pyflyte register ./
2023-12-29 09:08:55,633908 INFO {"asctime": "2023-12-29 09:08:55,633", "name": "flytekit", "levelname": "INFO", "message": "Using flytectl/YAML file.py:272
config /home/aiteam/.flyte/config.yaml"}
2023-12-29 09:08:55,639510 DEBUG {"asctime": "2023-12-29 09:08:55,639", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.endpoint could not be found in yaml config"}
2023-12-29 09:08:55,641170 DEBUG {"asctime": "2023-12-29 09:08:55,641", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.access-key could not be found in yaml config"}
2023-12-29 09:08:55,642768 DEBUG {"asctime": "2023-12-29 09:08:55,642", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.secret-key could not be found in yaml config"}
2023-12-29 09:08:55,668514 INFO {"asctime": "2023-12-29 09:08:55,668", "name": "flytekit", "levelname": "INFO", "message": "Using flytectl/YAML file.py:272
config /home/aiteam/.flyte/config.yaml"}
2023-12-29 09:08:55,671290 DEBUG {"asctime": "2023-12-29 09:08:55,671", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.insecureSkipVerify could not be found in yaml config"}
2023-12-29 09:08:55,674539 DEBUG {"asctime": "2023-12-29 09:08:55,674", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.caCertFilePath could not be found in yaml config"}
2023-12-29 09:08:55,676770 DEBUG {"asctime": "2023-12-29 09:08:55,676", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.command could not be found in yaml config"}
2023-12-29 09:08:55,678432 DEBUG {"asctime": "2023-12-29 09:08:55,678", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.clientId could not be found in yaml config"}
2023-12-29 09:08:55,680174 DEBUG {"asctime": "2023-12-29 09:08:55,680", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.clientSecretLocation could not be found in yaml config"}
2023-12-29 09:08:55,681888 DEBUG {"asctime": "2023-12-29 09:08:55,681", "name": "flytekit", "levelname": "DEBUG", "message": "Switch admin.scopes file.py:222
could not be found in yaml config"}
2023-12-29 09:08:55,683648 DEBUG {"asctime": "2023-12-29 09:08:55,683", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
console.endpoint could not be found in yaml config"}
2023-12-29 09:08:55,685305 DEBUG {"asctime": "2023-12-29 09:08:55,685", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.httpProxyURL could not be found in yaml config"}
2023-12-29 09:08:55,687107 DEBUG {"asctime": "2023-12-29 09:08:55,687", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.endpoint could not be found in yaml config"}
2023-12-29 09:08:55,688733 DEBUG {"asctime": "2023-12-29 09:08:55,688", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.access-key could not be found in yaml config"}
2023-12-29 09:08:55,690359 DEBUG {"asctime": "2023-12-29 09:08:55,690", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.secret-key could not be found in yaml config"}
2023-12-29 09:08:55,692297 INFO {"asctime": "2023-12-29 09:08:55,692", "name": "flytekit", "levelname": "INFO", "message": "Using flytectl/YAML file.py:272
config /home/aiteam/.flyte/config.yaml"}
2023-12-29 09:08:55,774191 INFO {"asctime": "2023-12-29 09:08:55,774", "name": "flytekit", "levelname": "INFO", "message": "Registering an base_agent.py:122
agent for task type sensor"}
Running pyflyte register from /mnt/data1/hainq/flyte-train with images ImageConfig(default_image=Image(name='default', fqn='cr.flyte.org/flyteorg/flytekit', tag='py3.8-1.9.1'), images=[Image(name='default', fqn='cr.flyte.org/flyteorg/flytekit', tag='py3.8-1.9.1')]) and image destination folder /root on 1 package(s) ('/mnt/data1/hainq/flyte-train',)
2023-12-29 09:08:56,061509 INFO {"asctime": "2023-12-29 09:08:56,061", "name": "flytekit", "levelname": "INFO", "message": "Using flytectl/YAML file.py:272
config /home/aiteam/.flyte/config.yaml"}
2023-12-29 09:08:56,064550 INFO {"asctime": "2023-12-29 09:08:56,064", "name": "flytekit", "levelname": "INFO", "message": "Using flytectl/YAML file.py:272
config /home/aiteam/.flyte/config.yaml"}
2023-12-29 09:08:56,066876 DEBUG {"asctime": "2023-12-29 09:08:56,066", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.insecureSkipVerify could not be found in yaml config"}
2023-12-29 09:08:56,068563 DEBUG {"asctime": "2023-12-29 09:08:56,068", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.caCertFilePath could not be found in yaml config"}
2023-12-29 09:08:56,070225 DEBUG {"asctime": "2023-12-29 09:08:56,070", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.command could not be found in yaml config"}
2023-12-29 09:08:56,071906 DEBUG {"asctime": "2023-12-29 09:08:56,071", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.clientId could not be found in yaml config"}
2023-12-29 09:08:56,073529 DEBUG {"asctime": "2023-12-29 09:08:56,073", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.clientSecretLocation could not be found in yaml config"}
2023-12-29 09:08:56,075053 DEBUG {"asctime": "2023-12-29 09:08:56,075", "name": "flytekit", "levelname": "DEBUG", "message": "Switch admin.scopes file.py:222
could not be found in yaml config"}
2023-12-29 09:08:56,076688 DEBUG {"asctime": "2023-12-29 09:08:56,076", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
console.endpoint could not be found in yaml config"}
2023-12-29 09:08:56,078326 DEBUG {"asctime": "2023-12-29 09:08:56,078", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
admin.httpProxyURL could not be found in yaml config"}
2023-12-29 09:08:56,080050 DEBUG {"asctime": "2023-12-29 09:08:56,080", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.endpoint could not be found in yaml config"}
2023-12-29 09:08:56,081634 DEBUG {"asctime": "2023-12-29 09:08:56,081", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.access-key could not be found in yaml config"}
2023-12-29 09:08:56,083435 DEBUG {"asctime": "2023-12-29 09:08:56,083", "name": "flytekit", "levelname": "DEBUG", "message": "Switch file.py:222
storage.connection.secret-key could not be found in yaml config"}
Registering against 192.168.1.205:30081
2023-12-29 09:08:56,235803 DEBUG {"asctime": "2023-12-29 09:08:56,235", "name": "flytekit", "levelname": "DEBUG", "message": "Common root folder repo.py:141
detected as /mnt/data1/hainq"}
Detected Root /mnt/data1/hainq, using this to create deployable package...
2023-12-29 09:08:56,245038 WARNING {"asctime": "2023-12-29 09:08:56,245", "name": "flytekit.cli", "levelname": "WARNING", "message": "Could not ignore.py:51
determine ignored files due to:\nb'fatal: not a git repository (or any parent up to mount point
/mnt)\\nStopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set).\\n'\nNot applying any
filters"}
Yee
Yee
Yee
Ryuu
12/29/2023, 2:58 AMRyuu
12/29/2023, 2:59 AMRyuu
12/29/2023, 3:00 AMRyuu
12/29/2023, 3:00 AMYee
/home/aiteam/.flyte/config.yaml
?Yee
Yee
Yee
Yee
Yee
Ryuu
12/29/2023, 3:47 AMRyuu
12/29/2023, 3:49 AMRyuu
12/29/2023, 3:50 AMRyuu
12/29/2023, 3:51 AMYee
which python
?Ryuu
12/29/2023, 7:28 AM$ python --version
Python 3.8.17
$ which python
/home/aiteam/anaconda3/envs/hainq/bin/python
Yee
/home/aiteam/anaconda3/envs/hainq/lib/site-packages/…
Ryuu
12/29/2023, 8:32 AMRyuu
12/29/2023, 10:00 AM# Here is the fuction i have chaged to debug, with the same package code, but i in my flyte binary react 300000 while on other sandbox is just 9000=> Very weird here
def compute_digest(source: os.PathLike, filter: Optional[callable] = None) -> str:
"""
Walks the entirety of the source dir to compute a deterministic md5 hex digest of the dir contents.
:param os.PathLike source:
:param Ignore ignore:
:return Text:
"""
hasher = hashlib.md5()
i = 1
for root, _, files in os.walk(source, topdown=True):
files.sort()
for fname in files:
print(i)
i+=1
abspath = os.path.join(root, fname)
relpath = os.path.relpath(abspath, source)
if filter:
if filter(relpath):
continue
_filehash_update(abspath, hasher)
_pathhash_update(relpath, hasher)
print('1')
print(hasher.hexdigest())
return hasher.hexdigest()
Ryuu
12/29/2023, 10:01 AMYee
Yee
i
?Yee
Yee
Ryuu
12/29/2023, 10:47 AMRyuu
12/29/2023, 10:48 AMRyuu
12/29/2023, 10:55 AMpyflyte register ./
##or
pyflyte reigster /mnt/data1/hainq/flyte-train
## both of above command do registration from root which contain a lot of other out space file => broken because total size for hashing is too large, all file from root /mnt/root/ ....
It's do register from root, I have change to use
pyflyte register train_workflows ## train_workflow is the sub-folder in flyte-train folder
## this script make the registration with train_workflow folder only
Ryuu
12/29/2023, 10:55 AMYee
Yee
Yee
Yee
Ryuu
12/29/2023, 10:57 AMRyuu
12/29/2023, 10:57 AMRyuu
12/29/2023, 10:57 AMRyuu
12/29/2023, 10:57 AMYee
pwd
?Ryuu
12/29/2023, 10:58 AMRyuu
12/29/2023, 10:58 AMRyuu
12/29/2023, 11:07 AMpyflyte register ./
pyflyte reigster /mnt/data1/hainq/flyte-train
This two 2 bad, both of 2 is register from /mnt/rootYee
./train_workflows
?Ryuu
12/29/2023, 11:09 AMYee
Yee
Ryuu
12/29/2023, 11:12 AMYee
Ryuu
12/29/2023, 11:14 AMYee