Add simplified model manager install API to InvocationContext #6132

lstein · 2024-04-04T03:31:18Z

Summary

This three two model manager-related methods to the InvocationContext uniform API. They are accessible via context.models.*:

load_local_model(model_path: Path, loader: Optional[Callable[[Path], AnyModel]] = None) -> LoadedModelWithoutConfig

Load the model located at the indicated path.

This will load a local model (.safetensors, .ckpt or diffusers directory) into the model manager RAM cache and return its LoadedModelWithoutConfig. If the optional loader argument is provided, the loader will be invoked to load the model into memory. Otherwise the method will call safetensors.torch.load_file() torch.load() (with a pickle scan), or from_pretrained() as appropriate to the path type.

Be aware that the LoadedModelWithoutConfig object differs from LoadedModel by having no config attribute.

Here is an example of usage:

def invoke(self, context: InvocatinContext) -> ImageOutput:
       model_path = Path('/opt/models/RealESRGAN_x4plus.pth')
       loadnet = context.models.load_local_model(model_path)
       with loadnet as loadnet_model:
             upscaler = RealESRGAN(loadnet=loadnet_model,...)

load_remote_model(source: str | AnyHttpUrl, loader: Optional[Callable[[Path], AnyModel]] = None) -> LoadedModelWithoutConfig

Load the model located at the indicated URL or repo_id.

This is similar to load_local_model() but it accepts either a HugginFace repo_id (as a string), or a URL. The model's file(s) will be downloaded to models/.download_cache and then loaded, returning a

def invoke(self, context: InvocatinContext) -> ImageOutput:
       model_url = 'https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth'
       loadnet = context.models.load_remote_model(model_url)
       with loadnet as loadnet_model:
             upscaler = RealESRGAN(loadnet=loadnet_model,...)

download_and_cache_model( source: str | AnyHttpUrl, access_token: Optional[str] = None, timeout: Optional[int] = 0) -> Path

Download the model file located at source to the models cache and return its Path. This will check models/.download_cache for the desired model file and download it from the indicated source if not already present. The local Path to the downloaded file is then returned.

Other Changes

This PR performs a migration, in which it renames models/.cache to models/.convert_cache, and migrates previously-downloaded ESRGAN, openpose, DepthAnything and Lama inpaint models from the models/core directory into models/.download_cache.

There are a number of legacy model files in models/core, such as GFPGAN, which are no longer used. This PR deletes them and tidies up the models/core directory.

Related Issues / Discussions

I have systematically replaced all the calls to download_with_progress_bar(). This function is no longer used elsewhere and has been removed.

QA Instructions

I have added unit tests for the three new calls. You may test that the load_and_cache_model() call is working by running the upscaler within the web app. On first try, you will see the model file being downloaded into the models .cache directory. On subsequent tries, the model will either load from RAM (if it hasn't been displaced) or will be loaded from the filesystem.

Merge Plan

Squash merge when approved.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)

lstein · 2024-04-14T12:54:25Z

I have added a migration script that tidies up the models/core directory and removes unused models such as GFPGAN. In addition, I have renamed models/.cache to models/.convert_cache to distinguish it from the directory in which just-in-time models are downloaded into, which is models/.download_cache. While the size of models/.convert_cache is capped such that less-used models are cleared periodically, files in models/.download_cache are not removed unless the user does so manually.

…wnload scheme

…okeAI into lstein/feat/simple-mm2-api

psychedelicious

I'm not sure what I was expecting the implementation to be, but it definitely wasn't as simple as this - great work.

I've requested a few changes and there's one discussion item that I'd like to marinate on before we change the public invocation API.

invokeai/app/invocations/controlnet_image_processors.py

invokeai/app/services/shared/sqlite_migrator/migrations/migration_10.py

invokeai/app/services/model_install/model_install_default.py

invokeai/app/invocations/upscale.py

invokeai/app/services/shared/invocation_context.py

- Set `self._context=context` instead of passing it as an arg

Just a bit of typo protection in lieu of full type safety for these methods, which is difficult due to the typing of `DownloadEventHandler`.

It's inherited from the ABC.

psychedelicious · 2024-06-03T00:52:02Z

invokeai/app/services/model_load/model_load_default.py

+        def diffusers_load_directory(directory: Path) -> AnyModel:
+            load_class = GenericDiffusersLoader(
+                app_config=self._app_config,
+                logger=self._logger,
+                ram_cache=self._ram_cache,
+                convert_cache=self.convert_cache,
+            ).get_hf_load_class(directory)
+            result: AnyModel = load_class.from_pretrained(model_path, torch_dtype=TorchDevice.choose_torch_dtype())
+            return result


This function is unused - I think the logic to get the loader should be checking if it's a directory? I'm not sure how to fix this myself bc the diffusers_load_directory function has a different type signature than the other loader function options.

Fixed. Looks like I never wired up that function! In the process I discovered a long-standing bug that would prevent text encoders from being loaded generically.

I also added a unit test for loading generic diffusers models. To do this, I added the 10 MB "tiny" taesdxl model, which was the smallest loadable diffusers I could find.

I don't think we should be adding models to the repo. Feels arbitrary to do it just for this one scenario and not all of them, plus 10MB increases the repo size by like 8%. Don't wanna get in the habit of doing this.

I think the best way to test model loading is to package up a models dir that has samples of various model types and configurations, then run full loading tests locally on one of our own machines or a custom CI runner.

I've commented out the unit test for directory loading.

Can you please drop the commits that added the model file so it's not in the repo?

psychedelicious

Sorry for the delay in reviewing. I've tidied a few things and tested everything, working great!

Two minor issues noted.

lstein · 2024-06-04T00:33:15Z

@psychedelicious I've addressed the remaining issues you raised. Thanks for a thorough review.

…okeAI into lstein/feat/simple-mm2-api

lstein · 2024-06-04T00:35:31Z

I removed a number of unnecessary changes in invocation_context.py, mostly extraneous type annotations. If mypy is complaining about these, then that's a mypy problem, because all the methods are annotated correctly.

I also moved load_model_from_url from the main model manager class into the invocation context.

Yes, mypy is having trouble tracking the return type of several methods. I haven't figured out what causes the problem and don't want to add a # type: ignore. But maybe I should 'cause I'm not ready to turn to pyright.

psychedelicious · 2024-06-04T03:04:29Z

Yes, mypy is having trouble tracking the return type of several methods. I haven't figured out what causes the problem and don't want to add a # type: ignore. But maybe I should 'cause I'm not ready to turn to pyright.

We shouldn't add # type: ignore, that will stop all type checkers from doing anything - including pyright. The places where you made code quality concessions to satisfy mypy involve very straightforward types - either your mypy config is borked or mypy itself is borked. FWIW, I've found pyright to be much faster, more thorough and more correct than mypy.

psychedelicious · 2024-06-04T03:05:14Z

@RyanJDick Would you mind doing one last review of this PR?

lstein · 2024-06-04T11:56:57Z

Yes, mypy is having trouble tracking the return type of several methods. I haven't figured out what causes the problem and don't want to add a # type: ignore. But maybe I should 'cause I'm not ready to turn to pyright.

We shouldn't add # type: ignore, that will stop all type checkers from doing anything - including pyright. The places where you made code quality concessions to satisfy mypy involve very straightforward types - either your mypy config is borked or mypy itself is borked. FWIW, I've found pyright to be much faster, more thorough and more correct than mypy.

You've convinced me. I've switched to pyright!

RyanJDick · 2024-06-04T22:53:12Z

@RyanJDick Would you mind doing one last review of this PR?

Looks like 43/44 files have changed since I last looked 😅 . I'll plan to spend a chunk of time on this tomorrow.

psychedelicious · 2024-06-05T00:27:22Z

@RyanJDick Can narrow that down to reviewing invocation_context.py, which changes the public API and is more important to get right the first time. Thanks.

RyanJDick

I just reviewed the invocation_context.py API.

RyanJDick · 2024-06-05T13:14:02Z

invokeai/app/services/shared/sqlite_migrator/migrations/migration_11.py

+    """
+    Build the migration from database version 9 to 10.
+
+    This migration does the following:
+    - Moves "core" models previously downloaded with download_with_progress_bar() into new
+      "models/.download_cache" directory.
+    - Renames "models/.cache" to "models/.convert_cache".
+    - Adds `error_type` and `error_message` columns to the session queue table.
+    - Renames the `error` column to `error_traceback`.
+    """


This docstring is outdated.

invokeai/app/services/shared/invocation_context.py

RyanJDick · 2024-06-05T13:47:30Z

invokeai/app/services/shared/invocation_context.py

+        if isinstance(source, Path):
+            return self._services.model_manager.load.load_model_from_path(model_path=source, loader=loader)
+        else:
+            model_path = self._services.model_manager.install.download_and_cache_model(source=str(source))
+            return self._services.model_manager.load.load_model_from_path(model_path=model_path, loader=loader)


I don't love that we switch behaviour based on whether source is a Path or a str. It feels like a fragile distinction, especially given the popularity of using strs to represent paths in Python. The caller should always know whether they are dealing with a path or a URL/repo name, so I think it's better to make this distinction explicit.

In this discussion we had landed on an API that didn't require this type condition. Was there a reason for moving away from that?

It looks like @psychedelicious removed load_model_from_url() in commit b12444 . Add it back?

I moved the method from the high-level services.model_manager class to the invocation context. The function is otherwise the same.

@psychedelicious I can’t find load_model_from_url() in invocation_context.py, and the commit indicates that it was deleted from the model manager and then the load_and_cache_model() was modified to no longer call it.

I’ll take care of fixing the API if the change wasn’t deliberate.

load_model_from_url() was this:

class ModelManagerService(ModelManagerServiceBase): # ... def load_model_from_url( self, source: str | AnyHttpUrl, loader: Optional[Callable[[Path], Dict[str, torch.Tensor]]] = None, ) -> LoadedModel: model_path = self.install.download_and_cache_model(source=str(source)) return self.load.load_model_from_path(model_path=model_path, loader=loader)

It was called in one place in invocation_context.py:

class ModelsInterface(InvocationContextInterface): def load_and_cache_model( self, source: Path | str | AnyHttpUrl, loader: Optional[Callable[[Path], dict[str, Tensor]]] = None, ) -> LoadedModel: if isinstance(source, Path): return self._services.model_manager.load.load_model_from_path(model_path=source, loader=loader) else: # Called here return self._services.model_manager.load_model_from_url(source=source, loader=loader)

What I did was copy the two lines from the function body directly into ModelsInterface.load_and_cache_model() - I don't think we should have load_model_from_url on the main ModelManagerService.

I think the API Ryan is suggesting is to have load_and_cache_from_path and load_and_cache_from_url on ModelsInterface

For clarity, I have renamed the methods load_local_model() and load_remote_model(). The former accepts the Path to a file or directory, and the latter accepts either a direct download URL or a HuggingFace URL. I have fixed the documentation and updated the pull request description.

Sounds good!

invokeai/app/services/shared/invocation_context.py

lstein · 2024-06-06T02:57:39Z

@RyanJDick I've fixed the issues you identified.

…l_odel()

…okeAI into lstein/feat/simple-mm2-api

RyanJDick

invocation_context.py looks good to me ✅

I'll defer to @psychedelicious for final approval, since he has a more complete understanding of this PR than me at this point.

add simplified model manager install API to InvocationContext

9cc1f20

github-actions bot added python PRs that change python files services PRs that change app services labels Apr 4, 2024

add simplified model manager install API to InvocationContext

af1b57a

lstein force-pushed the lstein/feat/simple-mm2-api branch from 9cc1f20 to af1b57a Compare April 12, 2024 01:46

Lincoln Stein added 2 commits April 12, 2024 00:55

add invocation_context.load_ckpt_from_url() method

df5ebdb

fix merge conflicts

3a26c7b

github-actions bot added invocations PRs that change invocations backend PRs that change backend files python-tests PRs that change python tests labels Apr 12, 2024

lstein marked this pull request as ready for review April 12, 2024 05:17

lstein requested review from blessedcoolant, brandonrising, RyanJDick, hipsterusername and psychedelicious as code owners April 12, 2024 05:17

Lincoln Stein added 2 commits April 14, 2024 15:57

port dw_openpose, depth_anything, and lama processors to new model do…

41b909c

…wnload scheme

change names of convert and download caches and add migration script

3ddd7ce

lstein force-pushed the lstein/feat/simple-mm2-api branch from 537a626 to 3ddd7ce Compare April 14, 2024 19:57

Lincoln Stein added 4 commits April 14, 2024 16:10

add simplified model manager install API to InvocationContext

34438ce

add invocation_context.load_ckpt_from_url() method

c140d3b

port dw_openpose, depth_anything, and lama processors to new model do…

3ead827

…wnload scheme

change names of convert and download caches and add migration script

fa6efac

lstein force-pushed the lstein/feat/simple-mm2-api branch from 3ddd7ce to fa6efac Compare April 14, 2024 20:10

Lincoln Stein and others added 4 commits April 15, 2024 09:14

Merge branch 'lstein/feat/simple-mm2-api' of github.com:invoke-ai/Inv…

f055e1e

…okeAI into lstein/feat/simple-mm2-api

Merge branch 'main' into lstein/feat/simple-mm2-api

f1e79d5

fix merge conflicts with main

470a399

Merge branch 'main' into lstein/feat/simple-mm2-api

34cdfc6

psychedelicious requested changes Apr 23, 2024

View reviewed changes

psychedelicious added 6 commits June 3, 2024 09:43

tidy(nodes): infill

521f907

- Set `self._context=context` instead of passing it as an arg

feat(download): add type for callback_name

6cc6a45

Just a bit of typo protection in lieu of full type safety for these methods, which is difficult due to the typing of `DownloadEventHandler`.

tidy(mm): minor formatting

c58ac1e

tidy(download): _download_job -> _multifile_job

aa9695e

tidy(mm): pass enum member instead of string

9941325

tidy(mm): remove extraneous docstring

c7f22b6

It's inherited from the ABC.

psychedelicious reviewed Jun 3, 2024

View reviewed changes

psychedelicious added 2 commits June 3, 2024 10:56

docs(mm): add comment in move_model_to_device

e7513f6

chore: ruff

a9962fd

psychedelicious self-requested a review June 3, 2024 01:54

psychedelicious requested changes Jun 3, 2024

View reviewed changes

Lincoln Stein and others added 2 commits June 3, 2024 20:31

add support for generic loading of diffusers directories

d4241c3

Merge branch 'main' into lstein/feat/simple-mm2-api

0f3c1b8

lstein requested a review from psychedelicious June 4, 2024 00:32

Lincoln Stein added 2 commits June 3, 2024 20:33

ruff fixes

1c6c5d7

Merge branch 'lstein/feat/simple-mm2-api' of github.com:invoke-ai/Inv…

aa89cc6

…okeAI into lstein/feat/simple-mm2-api

RyanJDick reviewed Jun 5, 2024

View reviewed changes

lstein requested a review from RyanJDick June 6, 2024 02:57

lstein and others added 3 commits June 5, 2024 22:58

Merge branch 'main' into lstein/feat/simple-mm2-api

60846c8

replace load_and_cache_model() with load_remote_model() and load_loca…

9cae74d

…l_odel()

Merge branch 'lstein/feat/simple-mm2-api' of github.com:invoke-ai/Inv…

d419be5

…okeAI into lstein/feat/simple-mm2-api

RyanJDick reviewed Jun 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add simplified model manager install API to InvocationContext #6132

Add simplified model manager install API to InvocationContext #6132

lstein commented Apr 4, 2024 •

edited

lstein commented Apr 14, 2024 •

edited

psychedelicious left a comment

psychedelicious Jun 3, 2024

lstein Jun 4, 2024

psychedelicious Jun 4, 2024

lstein Jun 6, 2024

psychedelicious Jun 6, 2024

psychedelicious left a comment

lstein commented Jun 4, 2024

lstein commented Jun 4, 2024

psychedelicious commented Jun 4, 2024

psychedelicious commented Jun 4, 2024

lstein commented Jun 4, 2024

RyanJDick commented Jun 4, 2024

psychedelicious commented Jun 5, 2024

RyanJDick left a comment

RyanJDick Jun 5, 2024

lstein Jun 6, 2024

RyanJDick Jun 5, 2024

lstein Jun 5, 2024

psychedelicious Jun 5, 2024

lstein Jun 6, 2024

psychedelicious Jun 6, 2024

lstein Jun 6, 2024

psychedelicious Jun 6, 2024

lstein commented Jun 6, 2024

RyanJDick left a comment

Add simplified model manager install API to InvocationContext #6132

Are you sure you want to change the base?

Add simplified model manager install API to InvocationContext #6132

Conversation

lstein commented Apr 4, 2024 • edited

Summary

Other Changes

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

lstein commented Apr 14, 2024 • edited

psychedelicious left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

psychedelicious left a comment

Choose a reason for hiding this comment

lstein commented Jun 4, 2024

lstein commented Jun 4, 2024

psychedelicious commented Jun 4, 2024

psychedelicious commented Jun 4, 2024

lstein commented Jun 4, 2024

RyanJDick commented Jun 4, 2024

psychedelicious commented Jun 5, 2024

RyanJDick left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lstein commented Jun 6, 2024

RyanJDick left a comment

Choose a reason for hiding this comment

lstein commented Apr 4, 2024 •

edited

lstein commented Apr 14, 2024 •

edited