github ConardLi/easy-dataset 1.5.1
[1.5.1] 2025-10-19

9 days ago

如果遇到 Github 下载慢的问题可以使用网盘下载:https://pan.quark.cn/s/194b7eedf16e

🔧 修复

  1. 删除文件时领域树修订不准确
    → 再次优化文件删除后领域树的更新逻辑,确保仅移除与删除文件强关联的节点,避免误删或残留无效节点,提升领域树结构准确性。

  2. 删除答案后问题状态未更新(#572
    → 修复删除问题生成的答案后,问题管理中仍显示“已生成答案”状态的问题,确保状态与实际数据一致。

  3. 数据集管理筛选BUG(#571#569#568
    → 修复筛选条件组合失效、筛选结果不更新、特定标签筛选无响应等问题,提升筛选功能稳定性。

  4. Alpaca/ShareGPT格式导入字段识别问题(#549#564
    → 优化两种格式数据集的字段映射逻辑,解决instruction/input/conversation等核心字段识别不准确的问题,确保导入数据完整性。

⚡ 优化

  1. 数据集导出支持选中项导出(#570
    → 导出数据集时新增“仅导出选中项”选项,支持手动勾选特定数据集进行导出,提升批量操作灵活性。

  2. 数据集确认与编辑优化(#542

    • 新增“取消确认”功能:确认数据集后可随时撤销确认状态,避免误操作导致的不可逆影响。
    • 数据集详情页支持直接编辑问题内容,无需跳转至单独页面,简化修改流程。

🔧 Fixes

  1. Inaccurate domain tree revision when deleting files
    → Further optimized domain tree update logic after file deletion to ensure only nodes strongly associated with deleted files are removed, avoiding incorrect deletions or residual invalid nodes and improving structural accuracy.

  2. Question status remains "answered" after deleting answers(#572
    → Fixed the issue where questions in the management list still showed "answer generated" status after their answers were deleted, ensuring status consistency with actual data.

  3. Dataset management filtering bugs(#571#569#568
    → Resolved issues such as invalid filter combinations, unupdated results, and unresponsive tag filtering, improving the stability of filtering functions.

  4. Inaccurate field recognition during Alpaca/ShareGPT import(#549#564
    → Optimized field mapping logic for these formats, fixing misrecognition of core fields like instruction/input/conversation to ensure complete data import.

⚡ Optimizations

  1. Support exporting only selected datasets(#570
    → Added an option to "export only selected items" during dataset export, allowing manual selection of specific datasets for export to enhance batch operation flexibility.

  2. Dataset confirmation and editing improvements(#542

    • Added "undo confirmation" function: Allows reverting the confirmed status of datasets to avoid irreversible impacts from misoperations.
    • Enabled direct question editing on the dataset details page, eliminating the need to navigate to a separate page and simplifying modification workflows.

Don't miss a new easy-dataset release

NewReleases is sending notifications on new releases.