Watch Question

Something went wrong in training time

Related to: AI Unlimited
Date: 03/01/2023 09:40
User: Joe Jackson
Awards:
Trying to run the training time step doesn't complete. I get "Something went wrong."

Here is the resulting code:

Traceback (most recent call last):
File "/content/diffusers/examples/dreambooth/train_dreambooth.py", line 852, in
main()
File "/content/diffusers/examples/dreambooth/train_dreambooth.py", line 512, in main
tokenizer = CLIPTokenizer.from_pretrained(args.pretrained_model_name_or_path, subfolder="tokenizer")
File "/usr/local/lib/python3.8/dist-packages/transformers/tokenization_utils_base.py", line 1736, in from_pretrained
resolved_vocab_files[file_id] = cached_file(
File "/usr/local/lib/python3.8/dist-packages/transformers/utils/hub.py", line 409, in cached_file
resolved_file = hf_hub_download(
File "/usr/local/lib/python3.8/dist-packages/huggingface_hub/file_download.py", line 1022, in hf_hub_download
cache_dir, repo_folder_name(repo_id=repo_id, repo_type=repo_type)
File "/usr/local/lib/python3.8/dist-packages/huggingface_hub/utils/_validators.py", line 92, in _inner_fn
validate_repo_id(arg_value)
File "/usr/local/lib/python3.8/dist-packages/huggingface_hub/utils/_validators.py", line 136, in validate_repo_id
raise HFValidationError(
huggingface_hub.utils._validators.HFValidationError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/content/stable-diffusion-v1-5'. Use `repo_type` argument if needed.
Traceback (most recent call last):
File "/usr/local/bin/accelerate", line 8, in
sys.exit(main())
File "/usr/local/lib/python3.8/dist-packages/accelerate/commands/accelerate_cli.py", line 43, in main
args.func(args)
File "/usr/local/lib/python3.8/dist-packages/accelerate/commands/launch.py", line 837, in launch_command
simple_launcher(args)
File "/usr/local/lib/python3.8/dist-packages/accelerate/commands/launch.py", line 354, in simple_launcher
raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd)
subprocess.CalledProcessError: Command '['/usr/bin/python3', '/content/diffusers/examples/dreambooth/train_dreambooth.py', '--save_starting_step=500', '--save_n_steps=0', '--train_text_encoder', '--pretrained_model_name_or_path=/content/stable-diffusion-v1-5', '--instance_data_dir=/content/data/joespirit', '--class_data_dir=/content/regularization_images/person_ddim', '--output_dir=/content/models/joespirit', '--with_prior_preservation', '--prior_loss_weight=1.0', '--instance_prompt=photo of joespirit person', '--class_prompt=a photo of a person, ultra detailed', '--seed=75576', '--resolution=512', '--mixed_precision=fp16', '--train_batch_size=1', '--gradient_accumulation_steps=1', '--gradient_checkpointing', '--use_8bit_adam', '--learning_rate=1e-6', '--lr_scheduler=constant', '--lr_warmup_steps=0', '--center_crop', '--max_train_steps=1600', '--num_class_images=500']' returned non-zero exit status 1.
Something went wrong

Answers

Date: 03/01/2023 09:40
User: Jorge Vila
Awards:
Respuesta Aprobada
Hi Joe;
Before start the training, make sure the folder /SD is deleted,
Try again and let me know in jorge@jorgevila.com if you have any troubles
Cheers
Jorge


Copyright jorgevila.com 2024

Helpdesk system developed for online help. If you need to contact me or use this system in your products, contact me through jorge@jorgevila.com