What's Changed
- [Eval,Arch] Update GPTQ eval and add
headless_mode
for Controller by @xingyaoww in - [eval,fix]: metrics get carried across eval instances by @xingyaoww in #3072
- [Eval] Support SWE-Bench pull from custom docker namespace by @xingyaoww in #3136
- [Arch] Shrink runtime image size by @xingyaoww in #3051
- [Arch] Add runtime image build CI & clean up runtime build using
jinja2
template by @xingyaoww in #3055 - CI: Force stop colima by @SmartManoj in #3053
- [FIX] Update SWEBenchSSHBox after global config was removed from sandbox. by @RajWorking in
- fix: add llm
drop_params
parameter to LLMConfig by @tobitege in #2471 - (fix) colima: use a docker context specific to runner; prevent duplicate start by @tobitege in #3097
- Bug fix: Metrics not accumulated across agent delegation by @li-boxuan in #3012
- fix: Remove extra arg from swebench ssh box by @xingyaoww in #3054
- (fix) fixed ghcr_push use of image name by @tobitege in #3101
- (fix) Runtime yml missing zip handling (fixes #3101) by @tobitege in #3104
- (fix) ghcr-runtime: no unzip, artifact downloads as-is (followup to #3104) by @tobitege in #3105
- fix: make max_budget_per_task optional in
run_agent_controller
by @xingyaoww in #3071 - Fix: revert torch version by @yufansong in #3118
- (fix) test_runtime: run tests per runtime, not alternating by @tobitege in #3103
- (fix) colima: fix return code handling (followup to #3097) by @tobitege in #3106
- fix (ghcr push): add missing extension by @xingyaoww in #3120
- fix (ghcr-runtime): fix filename for docker image tar by @xingyaoww in #3121
- (fix) Fix DummyAgent (used in E2E test) by @tobitege in #3137
- Fix(test,CI): runtime build tests by @xingyaoww in #3126
- [Docs] fixed broken shell command by @tolik518 in #3135
- refactor: rename 'changeAgentState' Issue#2977 by @DecodersLord in #3050
- (test|refactor)(frontend): Refactor and test the
FileIcon
component by @amanape in #3108 - Remove monologue agent by @neubig in #3036
#3014 - Remove config from files by @neubig in #3039
#2994 - Update Dockerfile casing by @charliez0 in #3045
- Remove global config from tests by @neubig in #3052
- Removed config from agent controller by @neubig in #3038
- Validate to_replace in edit_file_by_replace AgentSkill by @li-boxuan in #3073
- Modify codeAct paper link by @linshaoxin-maker in #3076
- Change doc title of agent hub by @neubig in #3100
- Update paper link in README.md by @xingyaoww in #3102
- Remove remaining global config by @neubig in #3099
- Always log user messages by @enyst in #3145
- chore-icon-transparency by @tofarr in #3138
- chore: Release 0.8.1 by @mamoodi in #3035
- chore(deps): bump @nextui-org/react from 2.4.3 to 2.4.5 in /frontend by @dependabot in #3021
- chore(deps-dev): bump openai from 1.35.13 to 1.36.0 by @dependabot in #3033
- chore(deps): bump uvicorn from 0.30.1 to 0.30.3 by @dependabot in #3062
- chore(deps-dev): bump mypy from 1.10.1 to 1.11.0 by @dependabot in #3066
- chore(deps): bump react-use from 17.5.0 to 17.5.1 in /docs by @dependabot in #3063
- chore(deps-dev): bump eslint-plugin-react from 7.34.4 to 7.35.0 in /frontend by @dependabot in #3060
- chore(deps-dev): bump jsdom from 24.1.0 to 24.1.1 in /frontend by @dependabot in #3057
- chore(deps-dev): bump openai from 1.36.0 to 1.36.1 by @dependabot in #3069
- chore(deps): bump litellm from 1.41.24 to 1.41.25 by @dependabot in #3064
- chore(deps-dev): bump pytest from 8.2.2 to 8.3.1 by @dependabot in #3065
- chore(deps-dev): bump ruff from 0.5.3 to 0.5.4 by @dependabot in #3068
- chore(deps): bump @nextui-org/react from 2.4.5 to 2.4.6 in /frontend by @dependabot in #3059
- chore(deps-dev): bump typescript from 5.5.3 to 5.5.4 in /docs by @dependabot in #3079
- chore(deps-dev): bump @typescript-eslint/parser from 7.16.1 to 7.17.0 in /frontend by @dependabot in #3080
- chore(deps-dev): bump openai from 1.36.1 to 1.37.0 by @dependabot in #3088
- chore(deps): bump litellm from 1.41.25 to 1.41.27 by @dependabot in #3086
- chore(deps-dev): bump typescript from 5.5.3 to 5.5.4 in /frontend by @dependabot in #3084
- chore(deps-dev): bump @testing-library/jest-dom from 6.4.6 to 6.4.8 in /frontend by @dependabot in #3083
- chore(deps-dev): bump @typescript-eslint/eslint-plugin from 7.16.1 to 7.17.0 in /frontend by @dependabot in #3081
- chore(deps): bump boto3 from 1.34.145 to 1.34.146 by @dependabot in #3087
- chore(deps-dev): bump chromadb from 0.5.4 to 0.5.5 by @dependabot in #3085
- chore(deps): bump litellm from 1.41.27 to 1.41.28 by @dependabot in #3092
- chore(deps): bump boto3 from 1.34.146 to 1.34.147 by @dependabot in #3093
- chore(deps): bump @react-types/shared from 3.24.0 to 3.24.1 in /frontend by @dependabot in #3094
- chore(deps-dev): bump @types/node from 20.14.11 to 20.14.12 in /frontend by @dependabot in #3095
- chore(deps): bump litellm from 1.41.28 to 1.42.1 by @dependabot in #3109
- chore(deps-dev): bump torch from 2.2.2 to 2.4.0 by @dependabot in #3110
- chore(deps): bump google-cloud-aiplatform from 1.59.0 to 1.60.0 by @dependabot in #3111
- chore(deps): bump boto3 from 1.34.147 to 1.34.148 by @dependabot in #3112
- chore(deps-dev): bump pytest from 8.3.1 to 8.3.2 by @dependabot in #3113
- chore(deps-dev): bump postcss from 8.4.39 to 8.4.40 in /frontend by @dependabot in #3114
- chore(deps): bump vite from 5.3.4 to 5.3.5 in /frontend by @dependabot in #3115
- chore(deps-dev): bump tailwindcss from 3.4.6 to 3.4.7 in /frontend by @dependabot in #3116
- chore(deps-dev): bump husky from 9.1.1 to 9.1.2 in /frontend by @dependabot in #3117
- chore(deps): bump litellm from 1.42.1 to 1.42.3 by @dependabot in #3131
- chore(deps-dev): bump ruff from 0.5.4 to 0.5.5 by @dependabot in #3132
- chore(deps-dev): bump streamlit from 1.36.0 to 1.37.0 by @dependabot in #3129
- chore(deps): bump boto3 from 1.34.148 to 1.34.149 by @dependabot in #3133
- chore(deps-dev): bump openai from 1.37.0 to 1.37.1 by @dependabot in #3134
- chore(deps): bump @react-types/shared from 3.23.1 to 3.24.0 in /frontend by @dependabot in #3082
New Contributors
- @charliez0 made their first contribution in #3045
- @DecodersLord made their first contribution in #3050
- @linshaoxin-maker made their first contribution in #3076
- @tolik518 made their first contribution in #3135
- @tofarr made their first contribution in #3138
Full Changelog: 0.8.1...0.8.2