New projects strengthen the open source AI and data ecosystem and expand the Foundation’s technical portfolio
SAN FRANCISCO, Calif., April 29, 2025 – LF AI & Data Foundation, an umbrella foundation of the Linux Foundation supporting open source innovation in artificial intelligence and data, today announced the induction of three new open source projects contributed by IBM: Docling, Data Prep Kit, and BeeAI. All three projects have officially been inducted by the LF AI & Data Technical Advisory Committee.
These contributions significantly enhance LF AI & Data’s technical landscape in three rapidly growing domains—semantic document understanding, enterprise-grade data preparation, and privacy-preserving federated learning—reinforcing the foundation’s mission to build a sustainable and open AI ecosystem.
The New Projects:
“We are excited to welcome Docling, Data Prep Kit, and BeeAI into the LF AI & Data family,” said Todd Moore, SVP, Community Operations at the Linux Foundation and interim Executive Director, LF AI & Data. “These contributions from IBM reflect a strong commitment to open collaboration and responsible AI. I love BeeAI’s commitment to both Javascript and Python for aggregated learning.”
“Docling, Data Prep Kit, and BeeAI were born from a need to fill critical gaps in AI development tooling and accelerate innovation in the Generative AI space. We’re proud to see them as a catalyst enabling the broader open-source community to build AI applications and agentic workflows,” said Brad Topol, Distinguished Engineer and Director of Open Source, IBM. "We’re excited to collaborate with the open-source community to evolve these technologies and solve real-world challenges together."
Governance & Community Collaboration
The projects will benefit from the governance, technical support, and ecosystem engagement that LF AI & Data provides to its hosted projects. All three projects have officially been inducted by the LF AI & Data Technical Advisory Committee (TAC) and will establish neutral, community-driven technical steering committees.
The projects are now publicly available for exploration and contribution. Developers, data scientists, and researchers are encouraged to get involved and shape the future of these impactful technologies.
For more information and to get involved, visit: https://lfaidata.foundation