github langchain-ai/langchain langchain-text-splitters==1.0.0a1

Initial release

release(text-splitters): 1.0.0a1 (#33214)
refactor(text-splitters): drop python 3.9 (#33212)
release: v1.0.0 (#32567)
docs: more standardization (#33124)
chore: bump ruff version to 0.13 (#33043)
test(text-splitters): capture beta warnings (#33113)
test(text-splitters): resolve pytest marker warning (#33112)
chore: update pyproject.toml files, remove codespell (#33028)
chore: bump mypy version to 1.18 (#32914)
fix(text-splitters): add validation to prevent infinite loop and prevent empty token splitter (#32205)
revert: "chore: remove ruff target-version" (#32895)
chore: remove ruff target-version (#32880)
chore(docs): update package READMEs (#32869)
chore(text-splitters): enable ruff docstring-code-format (#32854)
chore(text-splitters): select ALL rules with exclusions (#32325)
release(text-splitters): 0.3.11 (#32770)
docs(text-splitters): fix some docstrings (#32767)
chore(text-splitters): bump version of core (#32740)
chore(text-splitters): relax max bound for langchain-core (#32739)
chore: adress pytest-asyncio deprecation warnings + other nits (#32696)
docs(docs): fixed typos in documentations (#32661)
chore(text-splitters): add mypy pydantic plugin (#32611)
feat(text-splitters): add optional custom header pattern support (#31887)
chore(text-splitters): add mypy warn_unreachable (#32558)
docs: update outdated README.md content (#32540)
chore(text-splitters): bump mypy version to 1.17 (#32387)
feat(text-splitters): ruff fixes and rules (#32502)
chore: formatting across codebase (#32466)
fix: formatting issues in docstrings (#32265)
feat(docs): improve devx, fix Makefile targets (#32237)
fix(text-splitters): update langchain-core version to 0.3.72
fix(text-splitters): update lock for release
fix(langchain): class HTMLSemanticPreservingSplitter ignores the text inside the div tag (#32213)
feat(text-splitters): add Visual Basic 6 support (#31173)
text-splitters: Add ruff rules FBT (#31935)
text splitters: add chunk_size and chunk_overlap validations (#31916)
fix: automatically fix issues with ruff (#31897)
text-splitters: Ruff autofixes (#31858)
text-splitters: Bump ruff version to 0.12 (#31866)
text-splitters: bump spacy version to 3.8.7 (#31834)
text-splitters: Add ruff rule UP (pyupgrade) (#31841)
text-splitters: 0.3.9 (#31844)
security: Remove xslt_path and harden XML parsers in HTMLSectionSplitter: package: langchain-text-splitters (#31819)
text-splitters: fix stale header metadata in ExperimentalMarkdownSyntaxTextSplitter (#31622)
chore: Bump langsmith in splitter uv (#31626)
text-splitters: Add keep_separator arg to HTMLSemanticPreservingSplitter (#31588)
text-splitters[patch]: fix some import-untyped errors (#31030)
text-splitters: Fix regex separator merge bug in CharacterTextSplitter (#31137)
packaging: remove Python upper bound for langchain and co libs (#31025)
text-splitters: Set strict mypy rules (#30900)
text-splitters: release 0.3.8 (#30671)
fix huggingface tokenizer default length function (#30185)
text-splitters[patch]: Refactor HTMLHeaderTextSplitter for Enhanced Maintainability and Readability (#29397)
text-splitters: release 0.3.7 (#30347)
text-splitters: Add JSFrameworkTextSplitter for Handling JavaScript Framework Code (#28972)
multiple: fix uv path deps (#29790)
text-splitters[patch]: release 0.3.6 (#29647)
core: release 0.3.34 (#29644)
infra: migrate to uv (#29566)
text-splitters[patch]: delete unused html_chunks_with_headers.xslt (#29340)
text-splitters: Bump ruff version to 0.9 (#29231)
text-splitters[minor]: Replace lxml and XSLT with BeautifulSoup in HTMLHeaderTextSplitter for Improved Large HTML File Processing (#27678)
(text-splitters): Small Fix in _process_html for HTMLSemanticPreservingSplitter to properly extract the metadata. (#29215)
text-splitters[patch]: release 0.3.5 (#29054)
infra: speed up unit tests (#28974)
text_splitters: Add HTMLSemanticPreservingSplitter (#25911)
text-splitters: fix state persistence issue in ExperimentalMarkdownSyntaxTextSplitter (#28373)
text-splitters: release 0.3.4 (#28795)
text-splitters: Inconsistent results with NLTKTextSplitter's add_start_index=True (#27782)
text-splitters: extended-tests without socket (#28736)
text-splitters: test without socket (#28732)
text-splitters[patch]: Release 0.3.3 (#28723)
text-splitters: add pydocstyle linting (#28127)
docs: more api ref links, add linting step to prevent more (#28495)
many: use core 0.3.15 (#27834)
text-splitters[patch]: Release 0.3.1 (#27726)
all: test 3.13 ci (#27197)
Improvement[Community] Improve Document Loaders and Splitters (#27568)
multiple: rely on asyncio_mode auto in tests (#27200)
Improvement[text-splitter] Fix import of ExperimentalMarkdownSyntaxTextSplitter (#26703)
text-splitters: release 0.3 (#26460)
multiple: pydantic 2 compatibility, v0.3 (#26443)
text-splitters[patch]: Release 0.2.4 (#25979)
text-splitters[patch]: Modified SpacyTextSplitter to fully keep whitespace when strip_whitespace is false (#23272)
all: Improve make lint command (#25344)
text-splitters[patch]: fix typing for keep_separator (#25706)
text-splitters[patch]: Release 0.2.3 (#24998)
langchain : text_splitters Added PowerShell (#24582)
all: add release notes to pypi (#24519)
text_splitters: add request parameters for function HTMLHeaderTextSplitter.split_text… (#24178)
docs: Spell check fixes (#24217)
community[minor]: added support for C in RecursiveCharacterTextSplitter (#24091)
infra: update mypy 1.10, ruff 0.5 (#23721)
text-splitters: release 0.2.2 (#23508)
text-splitters: Fix/recursive json splitter data persistence issue (#21529)
text-splitters: Introduce Experimental Markdown Syntax Splitter (#22257)
core[patch]: Pin pydantic in py3.12.4 (#23130)
text-splitters[patch]: Fix HTMLSectionSplitter (#22812)
Community[minor]: Add language parser for Elixir (#22742)
multiple: get rid of pyproject extras (#22581)
text-splitters[patch]: Release 0.2.1 (#22490)
text-splitters[patch]: fix HTMLSectionSplitter parsing of xslt paths (#22176)
fix error message (#22437)
text-splitters[patch]: Extend TextSplitter:keep_separator functionality (#21130)
infra: fix CI on text-splitters (#21935)
text-splitters: release 0.2.0 (#21832)
multiple: core 0.2 nonbreaking dep, check_diff community->langchain dep (#21646)
text-splitters[patch]: fix MarkdownHeaderTextSplitter fails to parse headers with non-printable characters (#20645)
text-splitters[minor]: Add lua code splitting (#20421)
text-splitters[minor]: Adding a new section aware splitter to langchain (#16526)
text-splitters[minor]: Added Haskell support in langchain.text_splitter module (#16191)
infra: Consistent lxml requirements (#19520)
splitters: Add ensure_ascii parameter (#18485)
text-splitters[patch]: fix json split of RecursiveJsonSplitter (#19119)
community, langchain, infra: revert store extended test deps outside of poetry (#19153)
community, langchain, infra: store extended test deps outside of poetry (#18995)
docs: text splitters readme (#18359)
text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346)

Don't miss a new langchain release

NewReleases is sending notifications on new releases.