Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

如何进行效率最大化或者效果降级呢 #48

Open
potatoker opened this issue Apr 26, 2024 · 1 comment
Open

如何进行效率最大化或者效果降级呢 #48

potatoker opened this issue Apr 26, 2024 · 1 comment

Comments

@potatoker
Copy link

Readme实时性的结论是在V100的机器上进行测试的,测试了一下实时体验确实已经是超出预期的好了!但这里想请教下,如果实时推理时我想要使用更低端的显卡,同时允许输出视频帧的效果有一定的降级,这里有哪些参数或者方法我可以参考的吗。

@vincentWuK
Copy link

禁用梯度累积,每次推理后manual 删除gpu 上的临时变量并且清理缓存,单线程处理,使用fp16

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants