新增用户模型限流功能,支持在 系统设置-速率限制设置
中设置模型限流,支持设置总请求数限制和成功请求数限制

What's Changed
- feat: Add model request rate limiting functionality by @Calcium-Ion in #783
- feat: Pass extra_body in OpenAI request to the backend by @zeyugao in #781
New Contributors
Full Changelog: v0.4.8.3...v0.4.8.4