This list of Apache Software Foundation projects contains the software development projects of The Apache Software Foundation (ASF).[1]
Besides the projects, there are a few other distinct areas of Apache:
- Incubator: for aspiring ASF projects
- Attic: for retired ASF projects
- INFRA - Apache Infrastructure Team: provides and manages all infrastructure and services for the Apache Software Foundation, and for each project at the Foundation
Incubating projects
- Annotator: provides annotation enabling code for browsers, servers, and humans
- Baremaps: toolkit and a set of infrastructure components for creating, publishing and operating online maps
- Celeborn: intermediate data service for big data computing engines to boost performance, stability and flexibility
- DataLab: platform for creating self-service, exploratory data science environments in the cloud using best-of-breed data science tools
- DevLake: development data platform, providing the data infrastructure for developer teams to analyze and improve their engineering productivity
- HugeGraph: a large-scale and easy-to-use graph database
- KIE: community of solutions and supporting tooling for knowledge engineering and process automation, focusing on events, rules and workflows
- Liminal: an end-to-end platform for data engineers and scientists, allowing them to build, train and deploy machine learning models in a robust and agile way
- Livy: web service that exposes a REST interface for managing long-running Spark contexts
- Milagro: core security infrastructure for decentralized networks
- Nemo: data processing system
- NLPCraft: Java API for NLU applications
- OpenDAL: Open Data Access Layer. Offers native layer support, enabling users to implement middleware or intercept for all operations
- Paimon: unified lake storage to build dynamic tables for both stream and batch processing with big data compute engines, supporting high-speed data ingestion and real-time data query
- Pegasus: distributed key-value storage system which is designed to be simple, horizontally scalable, strongly consistent and high-performance
- Pekko: toolkit and an ecosystem for building highly concurrent, distributed, reactive and resilient applications for Java and Scala
- Pony Mail: mail-archiving, archive viewing, and interaction service
- StreamPark: a streaming application development platform
- Teaclave: universal secure computing platform
- Toree: provides applications with a mechanism to interactively and remotely access Spark
- Training: project aims to develop resources which can be used for training purposes in various media formats, languages and for various Apache and non-Apache target projects
- Tuweni: set of libraries and other tools to aid development of blockchain and other decentralized software in Java and other JVM languages
- Uniffle: an unified Remote Shuffle Service
- Wayang: cross-platform data processing system
The above may be incomplete, as the list of incubating project changes frequently.
Retired projects
A retired project is one which has been closed down on the initiative of the board, the project its PMC, the PPMC or the IPMC for various reasons. It is no longer developed at the Apache Software Foundation and does not have any other duties.
- Abdera: implementation of the Atom Syndication Format and Atom Publishing Protocol
- ACE: a distribution framework that allows central management and distribution of software components, configuration data and other artefacts to target systems
- Any23: Anything To Triples (Any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a variety of Web documents
- Apex: Enterprise-grade unified stream and batch processing engine
- Aurora: Mesos framework for long-running services and cron jobs
- AxKit: XML Application Server for Apache. It provided on-the-fly conversion from XML to any format, such as HTML, WAP or text using either W3C standard techniques, or flexible custom code
- Beehive: Java visual object model
- Buildr: a build system for Java-based applications, including support for Scala, Groovy and a growing number of JVM languages and tools
- Chemistry: provides open source implementations of the Content Management Interoperability Services (CMIS) specification
- Chukwa: Chukwa is an open source data collection system for monitoring large distributed systems
- Clerezza: a service platform which provides a set of functionality for management of semantically linked data accessible through RESTful Web Services and in a secured way
- Click: simple and easy-to-use Java Web Framework
- Continuum: continuous integration server
- Crimson: Java XML parser which supports XML 1.0 via various APIs
- Crunch: Provides a framework for writing, testing, and running MapReduce pipelines
- Deltacloud: provides common front-end APIs to abstract differences between cloud providers
- DeviceMap: device Data Repository and classification API
- DirectMemory: off-heap cache for the Java Virtual Machine
- DRAT: large scale code license analysis, auditing and reporting
- Eagle: open source analytics solution for identifying security and performance issues instantly on big data platforms
- ECS: API for generating elements for various markup languages
- ESME: secure and highly scalable microsharing and micromessaging platform that allows people to discover and meet one another and get controlled access to other sources of information, all in a business process context
- Etch: cross-platform, language- and transport-independent RPC-like messaging framework
- Excalibur: Java inversion of control framework including containers and components
- Falcon: data governance engine
- Forrest: documentation framework based upon Cocoon
- Giraph: scalable Graph Processing System
- Hama: Hama is an efficient and scalable general-purpose BSP computing engine
- Harmony: Java SE 5 and 6 runtime and development kit
- HiveMind: services and configuration microkernel
- iBATIS: Persistence framework which enables mapping SQL queries to POJOs
- Jakarta: server side Java, including its own set of subprojects
- Jakarta Cactus: simple test framework for unit testing server-side Java code
- Joshua: statistical machine translation toolkit
- Apache jUDDI Committee
- Scout: Apache Scout is an implementation of the JSR 93 (JAXR).
- Labs: a place for innovation where committees of the foundation can experiment with new ideas
- Lens: Unified Analytics Interface
- Lenya: content management system (CMS) based on Apache Cocoon
- Lucy: search engine library that provides full-text search for dynamic programming languages
- Marmotta: An Open Platform for Linked Data
- MetaModel: provides a common interface for discovery, exploration of metadata and querying of different types of data sources.
- Metron: Real-time big data security
- MRUnit: Java library that helps developers unit test Apache Hadoop map reduce jobs
- MXNet: Deep learning programming framework
- ODE: Apache ODE is a WS-BPEL implementation that supports web services orchestration using flexible process definitions.
- ObJectRelationalBridge (OJB): Object/Relational mapping tool that allowed transparent persistence for Java Objects against relational databases
- Oltu - Parent: OAuth protocol implementation in Java
- Onami: project focused on the development and maintenance of a set of Google Guice extensions not provided out of the box by the library itself
- OODT: Object Oriented Data Technology, a data management framework for capturing and sharing data
- Open Climate Workbench: A comprehensive suite of algorithms, libraries, and interfaces designed to standardize and streamline the process of interacting with large quantities of observational data and conducting regional climate model evaluations
- ORO: Regular Expression engine supporting various dialects
- Polygene: community based effort exploring Composite Oriented Programming for domain centric application development
- PredictionIO: PredictionIO is an open source Machine Learning Server built on top of state-of-the-art open source stack, that enables developers to manage and deploy production-ready predictive services for various kinds of machine learning tasks.
- REEF: A scale-out computing fabric that eases the development of Big Data applications on top of resource managers such as Apache YARN and Mesos
- Regexp: Regular Expression engine
- River: provides a standards-compliant JINI service
- Sentry: Fine grained authorization to data and metadata in Apache Hadoop
- Shale: web application framework based on JavaServer Faces
- Shindig: OpenSocial container; helps start hosting OpenSocial apps quickly by providing the code to render gadgets, proxy requests, and handle REST and RPC requests
- Sqoop: a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases
- STDCXX: collection of algorithms, containers, iterators, and other fundamental components of every piece of software, implemented as C++ classes, templates, and functions essential for writing C++ programs
- Stanbol: Software components for semantic content management
- Stratos: Platform-as-a-Service (PaaS) framework
- Tajo: relational data warehousing system. It using the hadoop file system as distributed storage.
- Tiles: templating framework built to simplify the development of web application user interfaces.
- Trafodion: Webscale SQL-on-Hadoop solution enabling transactional or operational workloads on Apache Hadoop[10][11][12]
- Tuscany: SCA implementation, also providing other SOA implementations
- Twill: Use Apache Hadoop YARN's distributed capabilities with a programming model that is similar to running threads
- Usergrid: an open-source Backend-as-a-Service ("BaaS" or "mBaaS") composed of an integrated distributed NoSQL database, application layer and client tier with SDKs for developers looking to rapidly build web and/or mobile applications
- VXQuery: Apache VXQuery implements a parallel XML Query processor.
- Wave: online real-time collaborative editing
- Whirr: set of libraries for running cloud services
- Wink: RESTFul web services based on JAX-RS specification
- Wookie: parser, server and plugins for working with W3C Packaged Web Apps
- WS Muse: implementation of the WS-ResourceFramework (WSRF), WS-BaseNotification (WSN), and WS-DistributedManagement (WSDM) specifications
- Xang: XML Web Framework that aggregated multiple data sources, made that data URL addressable and defined custom methods to access that data
- Xindice: XML Database
- Zipkin: distributed tracing system
- OpenCMIS: ollection of Java libraries, frameworks and tools around the CMIS specification for document interoperability.
The above may be incomplete, as the list of retired projects changes.