Vendor
The Apache Software Foundation
Company Website
- Apache Druid
Apache Druid is a high-performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale. It supports high concurrency, low latency ingestion, and is optimized for OLAP-style queries on large, high-dimensional datasets.
- Apache Batik
Apache Batik is a Java-based toolkit for applications or applets that need to handle Scalable Vector Graphics (SVG). It enables SVG display, generation, and manipulation, offering modular components like parsers, generators, DOM implementations, and transcoders for converting SVG to raster or other vector formats.
- Apache Iggy
Apache Iggy is a high-performance, persistent message streaming platform written in Rust. It processes millions of messages per second with ultra-low latency, supports multiple transport protocols, and offers multi-language SDKs, making it ideal for modern distributed systems and real-time data pipelines.
- Apache Uniffle
Apache Uniffle is a remote shuffle service designed to optimize data shuffle operations in distributed computing frameworks like Apache Spark and Hadoop. It improves performance, scalability, and fault tolerance while supporting cloud-native deployments and multiple storage backends.
- Apache Texen
Apache Texen is a general-purpose text generation utility based on the Velocity engine. It operates as an Ant task, using control and worker templates along with a context to produce structured text output, making it ideal for generating code, documentation, or configuration files programmatically.
- Apache OpenOffice
Apache OpenOffice is a free and open-source office productivity suite that includes applications for word processing, spreadsheets, presentations, graphics, databases, and more. It supports open standards and runs on multiple platforms, offering a reliable alternative to commercial office software.
- Apache Turbine
Apache Turbine is a servlet-based web framework for building modular, scalable Java web applications. It supports MVC architecture, service-oriented design, and integrates with various presentation and backend technologies to streamline development and deployment.
- Apache Geode
Apache Geode is a distributed, in-memory data management platform that delivers low-latency, high-concurrency access to data. It supports real-time applications with features like data replication, partitioning, event notifications, and ACID transactions across scalable cloud architectures.
- Apache DolphinScheduler
Apache DolphinScheduler is a distributed and extensible open-source workflow orchestration platform. It enables users to design, schedule, and monitor complex data workflows using a visual DAG interface. It supports high concurrency, low latency, and integrates with diverse data processing tools and environments.
- Apache Mahout
Apache Mahout is a machine learning framework focused on scalable algorithms for clustering, classification, and recommendation. It provides a distributed linear algebra engine and a Scala DSL, enabling efficient development of custom machine learning solutions for large-scale data processing.
- Apache Apex
Apache Apex is a native YARN big data-in-motion platform that unifies stream and batch processing. It enables scalable, fault-tolerant, and high-performance data processing on Hadoop, supporting real-time analytics and operational simplicity for enterprise-grade applications.
- Apache Log4php
Apache Log4php is a flexible and configurable logging framework for PHP applications, enabling developers to log messages to various destinations using different formats. It supports multiple configuration styles and is designed to improve debugging, monitoring, and application maintainability.
- Apache Gobblin
Apache Gobblin is a distributed data integration framework for ingesting, transforming, and managing large volumes of data from diverse sources in both streaming and batch environments. It supports scalability, fault tolerance, and metadata management across heterogeneous ecosystems.
- Apache NetBeans
Apache NetBeans is an open-source development environment, tooling platform, and application framework. It supports multiple languages and platforms, offering smart code editing, debugging, and refactoring tools for building modern desktop, web, and mobile applications.
- Apache Gump
Apache Gump is an open-source continuous integration system that builds and tests open-source Java projects against the latest versions of their dependencies. It helps detect compatibility issues early by compiling and testing projects from source in a coordinated and automated environment.
- Apache Lucene
Apache Lucene is a high-performance Java library for full-text search, indexing, and querying. It supports structured and unstructured data, faceting, spell correction, and vector-based nearest-neighbor search. Lucene is open source and highly scalable.
- Apache Arrow
Apache Arrow is a universal columnar memory format and multi-language development platform for high-performance data interchange and in-memory analytics. It enables efficient processing of flat and nested data structures across modern hardware and programming languages, supporting zero-copy reads and standardized data representation.
- Apache StormCrawler
Apache StormCrawler is an open-source SDK for building scalable, low-latency web crawlers using Apache Storm. It provides reusable components for stream-based and recursive crawling, making it suitable for real-time and large-scale web data extraction.
- Apache Lucene.NET
Lucene.NET is a high-performance, open-source search engine library written in C# as a line-by-line port of Apache Lucene. It enables full-text search capabilities for .NET applications across platforms, offering fast indexing and retrieval of large volumes of data.
- Apache NuttX
Apache NuttX is a real-time embedded operating system focused on standards compliance and a small footprint. It supports scalable development from 8-bit to 64-bit microcontrollers, offering rich POSIX interfaces, real-time scheduling, and modular configuration for deeply embedded systems.
- Apache Sling
Apache Sling is a framework for RESTful web applications based on a flexible content tree. It maps HTTP request URLs to content resources using path, extension, and selectors, enabling dynamic request processing through scripts and servlets. Its modular design supports specialized server instances and diverse applications.
- Apache OpenDAL
Apache OpenDAL is an open-source data access layer that provides a unified interface to interact with various storage backends, including cloud, local, and remote systems. It simplifies data operations by abstracting backend-specific logic and supports multiple programming languages.
- Apache Accumulo
Apache Accumulo is a distributed key/value store that provides scalable and secure data storage. It supports fine-grained access control, efficient data compression, and server-side processing, making it suitable for large-scale applications requiring high performance and flexible data management.
- Apache Serf
Apache Serf is a high-performance C-based HTTP client library using the Apache Portable Runtime. It supports asynchronous connections, SSL/TLS, HTTP pipelining, and multiple authentication modes. Serf is the default client library for Apache Subversion.
- Apache ServiceMix
Apache ServiceMix is a flexible, open-source integration container that unifies the features of Apache ActiveMQ, Camel, CXF, and Karaf into a powerful runtime platform for building integration solutions. It provides a complete, enterprise-ready ESB powered by OSGi.
- Apache BVal
Apache BVal provides an implementation of the Jakarta Bean Validation 3.0 Specification. It is TCK compliant, compatible with Java SE 11 or later, and supports validation of JavaBeans through annotations and constraints, enabling robust and standardized data validation in enterprise applications.
- Apache Fluss
Apache Fluss is an incubating project focused on building a distributed stream processing framework. It aims to provide scalable, fault-tolerant, and high-throughput data processing capabilities for real-time applications using modern cloud-native architecture principles.
- Apache Answer
Apache Answer is an open-source Q&A platform designed for teams and communities to share knowledge. It supports forums, help centers, and internal knowledge bases, enabling users to ask questions, provide answers, and collaborate efficiently in a structured and searchable environment.
- Apache Fory
Apache Fory is a blazing-fast cross-language serialization framework that uses just-in-time compilation and zero-copy techniques to achieve up to 170x performance gains over traditional methods. It supports multiple programming languages and simplifies data exchange between systems with intuitive usage and high flexibility.
- Apache Grails
Apache Grails is a powerful Groovy-based web application framework built on top of Spring Boot. It simplifies development through convention-over-configuration, seamless Java integration, and support for modern web technologies, enabling rapid creation of scalable and maintainable enterprise applications.
- Apache Tapestry
Apache Tapestry is a component-based web framework for Java that simplifies building scalable, maintainable web applications. It supports live class reloading, strong typing, and convention-over-configuration to accelerate development while maintaining performance and flexibility.
- Apache Impala
Apache Impala is a modern, open-source, distributed SQL query engine optimized for low-latency analytics on big data stored in Hadoop, HDFS, HBase, S3, and Iceberg. It supports standard SQL, scales efficiently, and integrates with Hive metadata and security.
- Apache Solr
Apache Solr is an open-source search platform built on Apache Lucene. It provides powerful full-text, faceted, and geospatial search capabilities, supports distributed indexing and querying, and is designed for scalability, fault tolerance, and extensibility in enterprise search applications.
- Apache HugeGraph
Apache HugeGraph is an open-source, general-purpose graph database system that supports efficient storage, fast querying, and large-scale graph computing. It is compatible with Apache TinkerPop3 and Gremlin, enabling deep relationship analysis, fraud detection, and knowledge graph applications across distributed environments.
- Apache Derby
Apache Derby is a lightweight, open source relational database written in Java. It supports embedded and client/server modes, adheres to JDBC and SQL standards, and is easy to deploy in Java-based applications.
- Apache Bigtop
Apache Bigtop is a project for infrastructure engineers and data scientists focused on packaging, testing, and configuring leading open-source big data components like Hadoop, HBase, and Spark. It supports multiple platforms and provides tools for automated deployment, validation, and maintenance of complete big data stacks.
- Apache Pony Mail
Apache Pony Mail is a web-based mail archive browser that enables fast searching, browsing, and replying to mailing list threads. It supports public and private lists, OAuth2 authentication, and scalable performance, making it ideal for managing large volumes of archived email communications.
- Apache Toree
Apache Toree is a Jupyter kernel that enables interactive access to Apache Spark. It supports multiple languages including Scala and Python, allowing users to run Spark code in notebooks for data analysis, visualization, and distributed computing workflows.
- Apache Groovy
Apache Groovy is a dynamic language for the Java platform that enhances productivity through concise syntax and powerful features. It integrates seamlessly with Java, supports scripting and domain-specific languages, and is widely used for application development, testing, and automation.
- Apache Velocity
Apache Velocity is a Java-based template engine that allows developers to reference Java objects using a simple template language. It separates code from presentation, enabling maintainable web development and supporting generation of content like HTML, XML, SQL, and reports.
- Apache Causeway
Apache Causeway is an open-source framework for rapidly developing domain-driven applications in Java. It automatically generates user interfaces and APIs from domain models, allowing developers to focus on business logic while enabling seamless integration and clean architecture.
- Apache Daffodil
Apache Daffodil is an open-source implementation of the Data Format Description Language (DFDL), enabling conversion between fixed-format data and XML or JSON for seamless data integration and processing across diverse systems.
- Apache TinkerPop
Apache TinkerPop is an open-source graph computing framework that supports both online transaction processing (OLTP) with graph databases and online analytical processing (OLAP) with graph analytics systems. It enables complex graph traversals using its powerful query language, Gremlin, and is widely supported across commercial and open-source graph platforms.
- Apache SINGA
Apache SINGA is a distributed deep learning library designed for scalable training across multiple devices. It supports various neural network architectures, provides a flexible programming interface, and integrates with ONNX for model interoperability, making it suitable for research and enterprise AI applications.
- Apache Kylin
Apache Kylin is an open-source OLAP engine for big data that delivers sub-second query latency on trillions of records, enabling high-performance, high-concurrency analytics with seamless BI tool integration.
- Apache FreeMarker
Apache FreeMarker is a Java-based template engine that generates text output from templates and data models, supporting formats like HTML, XML, and JSON. It separates presentation from logic, enabling clean, maintainable code and flexible integration in web and non-web applications.
- Apache PLC4X
Apache PLC4X is a universal protocol adapter for Industrial IoT that enables communication with industrial programmable logic controllers using various protocols through a shared API. It allows integration of shopfloor assets without retrofitting, offering high performance, security, and open-source flexibility.
- Apache XMLBeans
Apache XMLBeans is a Java library that provides XML binding capabilities, allowing developers to access and manipulate XML data using generated Java classes based on XML Schema definitions.
- Apache HoraeDB
Apache HoraeDB is a high-performance, distributed, cloud-native time-series database. It is designed for scalability, cost-efficiency, and flexibility, supporting real-time data ingestion, querying, and analytics across large datasets in modern infrastructure environments.
- Apache BifroMQ
Apache BifroMQ is a high-performance, distributed MQTT broker designed for large-scale IoT messaging systems. It supports full MQTT protocol versions 3.1 to 5.0 and includes native multi-tenancy, built-in storage, and extensibility for authentication, monitoring, and event handling.
- Apache PDFBox
Apache PDFBox is an open-source Java library for working with PDF documents. It allows developers to create new PDFs, manipulate existing ones, and extract content such as text and metadata. PDFBox also includes command-line tools for document processing and supports rendering and font management.
- Apache Seata
Apache Seata is a distributed transaction solution that ensures data consistency across microservices. It supports multiple transaction models and integrates with various databases and RPC frameworks, enabling reliable, scalable, and high-performance transaction management in cloud-native environments.
- Apache Tentacles
Apache Tentacles is a tool for inspecting staged software release artifacts. It downloads and unpacks archives recursively, generating reports that help verify the presence of license and notice files, ensuring compliance and completeness before official release.
- Apache OpenServerless
Apache OpenServerless is a cloud-agnostic serverless platform designed for Kubernetes and Linux environments. It simplifies deployment of AI applications by eliminating container orchestration complexity, offering a production-ready, vendor-neutral solution for building and running serverless workloads anywhere.
- Apache Geronimo
Apache Geronimo is an open-source platform providing Java EE and Jakarta EE libraries along with MicroProfile implementations. It enables modular server runtimes using Java and OSGi technologies, supporting enterprise development with reusable components and native image generation via GraalVM.
- Apache SDAP
Apache SDAP is a cloud-optimized platform for scalable Earth science data analytics. It supports satellite and in situ data integration, anomaly detection, fast subsetting, and machine learning-driven discovery, enabling researchers to analyze environmental trends and harmonize data across distributed infrastructures.
- Apache Shiro
Apache Shiro is a Java security framework that simplifies authentication, authorization, cryptography, and session management. It provides a clean API and flexible architecture for securing applications in various environments, from web to enterprise systems.
- Apache Community Development
Apache Community Development is a project that supports open-source communities by providing tools, guidance, and mentoring. It helps contributors and projects grow sustainably, promotes diversity and inclusion, and fosters collaboration across the Apache Software Foundation ecosystem.
- Apache Lucy
Apache Lucy is a high-performance, full-text search engine library for dynamic programming languages. It provides scalable indexing and retrieval, Unicode support, and modular architecture, making it suitable for building fast and flexible search applications in environments like Perl and C.
- Apache Archiva
Apache Archiva is an extensible repository management software that helps manage personal or enterprise-wide build artifact repositories. It is the perfect companion for build tools such as Maven, Continuum, and ANT. Archiva offers capabilities like remote repository proxying, security access management, artifact storage, delivery, browsing, indexing, usage reporting, and extensible scanning functionality.
- Apache JSPWiki
Apache JSPWiki is a feature-rich, open-source Wiki engine built on Java EE technologies. It supports multiple wiki syntaxes, file attachments, templates, and flexible data storage options. Designed for extensibility and ease of use, it enables collaborative content creation with fine-grained access control and plugin support.
- Apache Dubbo
Apache Dubbo is a cloud-native microservice framework offering high-performance RPC, service discovery, traffic management, observability, and security. It supports multiple languages and protocols, enabling scalable, extensible, and production-ready deployments across diverse environments.
- bRPC
bRPC is an industrial-grade RPC framework in C++ for building reliable, high-performance services across multiple protocols, supporting synchronous and asynchronous communication, profiling, and extensibility for distributed systems.
- Apache Synapse
Apache Synapse is a lightweight, high-performance enterprise service bus that enables the mediation, transformation, and routing of messages between services. It supports multiple protocols and message formats, making it ideal for integrating heterogeneous systems in scalable and flexible service-oriented architectures.
- Apache TVM
Apache TVM is an open-source machine learning compiler framework that enables efficient deployment of ML models on diverse hardware backends including CPUs, GPUs, and accelerators. It optimizes computations and supports multiple frontends like TensorFlow, PyTorch, and Keras.
- Apache TsFile
Apache TsFile is a columnar storage file format optimized for time series data. It supports efficient compression, high read/write throughput, and seamless integration with big data frameworks like Spark and Flink, making it ideal for IoT and time series database applications.
- Apache Traffic Server
Apache Traffic Server is a fast, scalable caching proxy server that handles HTTP and HTTPS traffic. It improves web performance by caching content, supports reverse and forward proxy configurations, and is used by large-scale infrastructures to reduce latency and bandwidth usage.
- Apache Directory
Apache Directory is a suite of open-source directory solutions written in Java. It includes an LDAPv3-compliant directory server, Kerberos support, and tools for managing identity and access. Designed for extensibility and integration, it simplifies secure authentication and directory service deployment across platforms.
- Apache Ambari
Apache Ambari is an open-source management platform for provisioning, monitoring, and managing Apache Hadoop clusters. It provides a web-based UI and RESTful APIs to simplify cluster setup, service configuration, and health monitoring across distributed environments.
- Apache Fluo
Apache Fluo is a distributed system for incrementally processing large-scale data. It enables real-time updates by executing cross-node transactions triggered by data changes, allowing continuous integration without full dataset reprocessing. Built on Apache Accumulo, it supports reactive workflows and scalable data consistency.
- Apache Axis
Apache Axis is a SOAP-based web services framework that enables developers to create, deploy, and consume web services using Java or C++. It supports modular extensions for security, reliability, and eventing, making it suitable for scalable and interoperable service-oriented architectures.
- Apache Roller
Apache Roller is an open-source, Java-based blog server that supports multi-user and group blogging. It is designed for scalability and flexibility, making it suitable for both small and large blog sites.
- Apache SystemDS
Apache SystemDS is an open-source machine learning system for end-to-end data science workflows. It supports scalable training, data cleaning, feature engineering, and deployment using high-level scripting and hybrid execution across local and distributed environments, optimized for performance and flexibility.
- Apache ResilientDB
Apache ResilientDB is a high-throughput distributed ledger designed with scale-centric principles to support decentralized computation. It reimagines blockchain architecture using parallelism and deep pipelining to optimize performance across modern hardware and global cloud infrastructure.
- Apache Training
Apache Training is an open-source project that provides reusable educational materials for Apache technologies. It offers slides, labs, and exercises to support consistent, high-quality training across communities, helping instructors and learners engage with Apache software more effectively.
- Apache Nutch
Apache Nutch is an extensible and scalable open-source web crawler for data mining and indexing. It supports batch processing with Hadoop, customizable plugins, and integration with search platforms, making it ideal for large-scale web content collection and analysis.
- Apache Calcite
Apache Calcite is a dynamic data management framework that provides SQL parsing, query optimization, and connectivity to diverse data sources. It does not store data itself but enables powerful query capabilities across multiple formats and systems.
- Apache Log4net
Apache Log4net is a logging framework for .NET applications that enables developers to log messages to various outputs with configurable formats and levels. It supports hierarchical logging, dynamic configuration, and multiple appenders, making it suitable for both simple and complex enterprise environments.
- Apache Logging Services
Apache Logging Services provides reliable, flexible, and extensible logging frameworks for various programming languages, including Java, .NET, and C++. It enables developers to capture, manage, and analyze application logs efficiently, supporting multiple output formats, configurations, and performance-optimized logging strategies.
- Apache Solr Operator
Apache Solr Operator is a Kubernetes-native management tool designed to deploy, scale, and maintain Apache Solr clusters. It automates Solr Cloud orchestration, enabling reliable, scalable, and fault-tolerant search infrastructure with minimal manual intervention.
- Apache Hadoop
Apache Hadoop is an open-source framework for distributed storage and processing of large data sets using clusters of commodity hardware. It provides scalability, fault tolerance, and high-throughput access through modules like HDFS, YARN, and MapReduce.
- Apache Tez
Apache Tez is a data processing framework built on Hadoop YARN that executes complex directed-acyclic-graphs of tasks. It replaces multiple MapReduce jobs with a single Tez job, improving performance, resource efficiency, and flexibility for big data applications like Hive and Pig.
- Apache AntUnit
Apache AntUnit is an Antlib providing a test framework for Apache Ant tasks and types. It replaces JUnit with Ant-based assertions, enabling build file-driven testing. AntUnit simplifies test creation, supports setup and teardown targets, and includes predefined assertion tasks for validating build behavior.
- Apache Tcl
Apache Tcl integrates the Tcl scripting language with the Apache HTTP server to create dynamic web applications. It enhances Tcl’s flexibility and robustness by embedding it directly into the server, enabling efficient execution of Tcl scripts and templates for web content generation.
- Apache Commons
Apache Commons is an Apache project focused on reusable Java components. It consists of Commons Proper for stable libraries, Commons Sandbox for experimental development, and Commons Dormant for inactive components. Its goal is to simplify Java development by providing well-tested, modular, and low-dependency utilities.
- Apache OpenMeetings
Apache OpenMeetings is an open-source web conferencing platform that enables video meetings, screen sharing, collaborative whiteboarding, document editing, and messaging. It supports user and room management, calendar integration, and session recording, and is accessible via browser without additional software installation.
- Apache CloudStack
Apache CloudStack is open-source software for deploying and managing large networks of virtual machines. It provides a scalable, highly available Infrastructure-as-a-Service (IaaS) platform used for public, private, and hybrid cloud environments.
- Apache Ivy
Apache Ivy is a flexible and powerful dependency manager for Java projects, integrated with Apache Ant. It supports transitive dependencies, conflict resolution, and customizable repository configurations. Ivy simplifies builds, enhances performance, and is open source under the Apache License.
- Apache Cloudberry
Apache Cloudberry is an open-source MPP database built on PostgreSQL 14, designed for data warehousing, large-scale analytics, and AI/ML workloads, offering advanced enterprise capabilities and compatibility with Greenplum.
- Apache Baremaps
Apache Baremaps is a toolkit and infrastructure platform for creating, publishing, and operating online maps. It enables custom vector tile generation from PostGIS and supports formats like OsmPbf, ShapeFile, and GeoPackage, offering real-time styling, geocoding, and extensible mapping workflows.
- Apache MRUnit
Apache MRUnit is a Java-based unit testing framework for Hadoop MapReduce applications. It allows developers to test Mapper and Reducer logic independently without deploying to a cluster, improving code quality and simplifying debugging in big data processing workflows.
- Apache Whimsy
Apache Whimsy is a set of tools developed to support the internal operations of the Apache Software Foundation. It visualizes organizational data, automates administrative tasks, and enables rapid deployment of prototypes for ASF processes, primarily used by committers and members with access to internal systems.
- Apache Chainsaw
Apache Chainsaw is a GUI-based log viewer for Log4j, designed to simplify the analysis of logging events from local or remote sources, supporting various formats and protocols including XMLLayout, TCP, UDP, and java.util.logging.
- Apache Petri
Apache Petri mentors external project communities to help them understand Apache Software Foundation principles and build sustainable, meritocratic governance. It guides them toward becoming official Apache projects through education and community development.
- Apache Buildr
Apache Buildr is a build system for Java-based applications with support for Scala, Groovy, and other JVM languages. It emphasizes simplicity, speed, and extensibility, using Ruby for scripting. Buildr automates compiling, testing, packaging, and dependency management with minimal configuration.
- Apache OzHera
Apache OzHera is an application performance monitoring platform for cloud-native environments. It integrates metrics, tracing, logging, and alerting to detect and resolve issues quickly. With AI-powered diagnostics and low-cost storage, it enhances system stability and observability while minimizing business system intrusion.
- Apache Royale
Apache Royale is a productive open-source frontend technology for building expressive applications using MXML and ActionScript 3.0, delivering consistent output across browsers, desktops, and mobile devices. It supports enterprise-level development, fast performance, and modular architecture for scalable cross-platform solutions.
- Apache Pegasus
Apache Pegasus is a distributed key-value storage system designed for high performance, strong consistency, and horizontal scalability. It bridges the gap between in-memory systems like Redis and persistent stores like HBase, offering reliable data replication, automatic load balancing, and efficient cross-datacenter data migration.
- Apache Kibble
Apache Kibble is an open-source platform for analyzing and visualizing activity in software projects, providing insights into development trends, contributor engagement, and project health through data collected from repositories, issue trackers, and communication channels.
- Apache ECharts
Apache ECharts is a powerful, open-source JavaScript visualization library for creating interactive, customizable charts. It supports over 20 chart types, real-time rendering of large datasets, responsive design, and accessibility features, making it ideal for data-rich web applications across devices and platforms.
- Apache SkyWalking
Apache SkyWalking is an open-source observability and application performance monitoring (APM) platform designed for distributed systems, microservices, and cloud-native architectures. It collects, analyzes, and visualizes telemetry data including metrics, logs, and traces to provide deep insights into system behavior and performance.
- Apache Spatial Information System (SIS)
Apache SIS is a Java library for developing geospatial applications. It supports metadata, coordinate transformations, and raster data processing, enabling accurate and standards-compliant spatial analysis for desktop and server environments.
- Apache Hop
Apache Hop is an open-source data orchestration platform that enables visual design and execution of data pipelines across multiple environments and runtimes. It supports metadata-driven development, making data integration tasks efficient, reusable, and scalable for modern data engineering workflows.
- Apache Attic
Apache Attic is a project of the Apache Software Foundation that provides a formal process for retiring Apache projects that are no longer actively maintained. It ensures transparency and continuity by preserving project assets while ceasing active development, allowing users to access legacy code and documentation.
- Apache Rya
Apache Rya is a scalable cloud-based RDF triple store supporting SPARQL queries, built on Apache Accumulo and MongoDB. It enables efficient storage, retrieval, and analysis of billions of RDF triples using advanced indexing and query techniques.
- Apache Flink
Apache Flink is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams. It runs in all common cluster environments, performs computations at in-memory speed, and scales efficiently for real-time analytics and data-driven applications.
- Apache Fortress
Apache Fortress is a Java-based access management system that implements role-based access control standards. It integrates with LDAP directories to manage authentication, authorization, and administrative policies for secure and scalable identity governance.
- Apache Axiom
Apache Axiom is a high-performance XML object model that supports on-demand building and streaming access using the StAX API. It enables efficient XML processing with support for binary data via XOP and MTOM, and serves as the core of Apache Axis2 while remaining usable independently.
- Apache Rat
Apache Rat is a license auditing tool that scans project files to ensure proper license headers are present. It helps maintain compliance with open-source licensing policies and is commonly used in release processes to verify legal readiness of software distributions.
- Apache Griffin
Apache Griffin is an open-source data quality solution for big data systems. It supports both batch and streaming modes, allowing users to define, measure, and monitor data quality metrics across diverse sources using a flexible rule-based framework integrated with modern data platforms.
- Apache Gluten
Apache Gluten is a native acceleration layer for Apache Spark SQL. It offloads query execution to high-performance engines like Velox, improving speed and efficiency while maintaining compatibility with Spark’s APIs and distributed architecture.
- Apache Atlas
Apache Atlas is a metadata management and data governance platform for managing, discovering, and classifying data assets. It provides data lineage, search, and policy enforcement capabilities, helping organizations maintain compliance, improve data quality, and enable collaboration across data teams.
- Apache Beam
Apache Beam is an open-source unified programming model for batch and streaming data processing. It enables users to define data workflows using language-specific SDKs and execute them across diverse execution engines, supporting scalable, portable, and extensible data integration and transformation pipelines.
- Apache cTAKES
Apache cTAKES is an open-source natural language processing system for extracting information from electronic medical records. It identifies clinical concepts like diseases, medications, and procedures using machine learning and linguistic analysis tailored for healthcare data.
- Apache Hive
Apache Hive is a distributed data warehouse system built on Apache Hadoop. It enables reading, writing, and managing large datasets stored in distributed systems using SQL. Hive supports data warehousing tasks like ETL, reporting, and analysis, and integrates with various storage formats and engines.
- Apache Compress Antlib
Apache Compress Antlib is an Apache Ant extension that adds support for compressing and archiving files using formats like zip, tar, gzip, bzip2, xz, and more, enhancing build automation with flexible and modern packaging capabilities.
- Apache Clerezza
Apache Clerezza is a set of Java libraries for managing semantically linked data using RDF triples. It provides a technology-agnostic layer to access and modify triple stores, supports multiple named graphs, and includes services for parsing, serializing, and interacting with RDF data.
- Apache Web Services
Apache Web Services is a collection of tools and libraries for building standards-compliant web services. It supports XML, SOAP, WSDL, and WS-Security, enabling secure and interoperable communication between distributed systems using Java-based technologies.
- Apache Olingo
Apache Olingo is an open-source Java library that implements the Open Data Protocol (OData). It enables developers to build OData producers and consumers, supporting both client and server aspects. The library currently supports OData 2.0 and aims to support OData 4.0.
- Apache ZooKeeper
Apache ZooKeeper is a centralized, open-source service that enables reliable distributed coordination by managing configuration, naming, synchronization, and group services for distributed applications. It simplifies complex coordination tasks and ensures consistency, availability, and fault tolerance across systems.
- Apache MINA
Apache MINA is a network application framework that simplifies the development of high-performance and scalable network applications. It provides an event-driven asynchronous API over transports like TCP/IP and UDP/IP using Java NIO, supporting protocols such as FTP, SSH, and XMPP.
- Apache ORC
Apache ORC is a columnar storage format optimized for big data processing. It provides efficient compression, indexing, and fast data access, making it ideal for analytics workloads in Hadoop-based systems. ORC supports complex types and is designed for high performance and scalability.
- Apache Flagon
Apache Flagon is an open-source behavioral logging framework designed for business analytics, usability testing, and user research. It enables lightweight, client-side instrumentation to capture user interactions across web applications, supporting scalable data collection and analysis for enterprise and scientific environments.
- Apache Asterixdb
Apache Asterixdb is a scalable open-source big data management system designed for storing, indexing, and querying large volumes of semi-structured data. It supports flexible data models, declarative queries, and distributed processing, making it suitable for data-intensive applications across research, analytics, and enterprise environments.
- Apache Ant
Apache Ant is a Java-based build automation tool that uses XML to define tasks and workflows. It simplifies compiling, packaging, testing, and deploying applications, offering flexibility and extensibility for various programming environments beyond Java.
- Apache Hudi
Apache Hudi is an open-source data lakehouse platform that enables efficient, incremental data processing with ACID guarantees, time travel, and schema evolution. It supports streaming and batch workloads, offers high-performance indexing, and integrates with cloud-native and open data ecosystems.
- Apache Tika
Apache Tika is a content analysis toolkit that detects and extracts metadata and text from over a thousand file types through a single interface. It is widely used for search engine indexing, content analysis, translation, and digital forensics.
- Apache Wayang
Apache Wayang is a cross-platform data processing system that enables seamless execution across multiple engines. It supports ultra-fast, federated in-situ processing and simplifies application portability with minimal code changes.
- Apache Velocity Anakia
Apache Velocity Anakia transforms XML documents into output formats using Velocity templates and JDOM. It simplifies static site generation and documentation by offering a lightweight alternative to XSLT, ideal for projects needing consistent, template-driven HTML output.
- Apache CouchDB
Apache CouchDB is a NoSQL database that uses JSON for data storage, JavaScript for querying, and HTTP for communication. It supports multi-master replication, offline-first sync, and clustering, making it ideal for distributed systems and mobile applications with high reliability and scalability.
- Apache Struts
Apache Struts is a free, open-source MVC framework for building modern Java web applications. It emphasizes convention over configuration, supports extensibility through plugins, and includes built-in support for REST, AJAX, and JSON for dynamic and scalable web development.
- Apache Unomi
Apache Unomi is an open-source customer data platform built in Java. It manages user profiles, events, and personalization across channels while supporting privacy compliance like GDPR. It integrates with various systems and enables scalable, extensible, and secure customer experience management.
- Apache RocketMQ
Apache RocketMQ is a distributed messaging and streaming platform designed for high performance, scalability, and reliability. It supports real-time messaging, event-driven architecture, and stream processing, making it ideal for cloud-native applications and large-scale data systems.
- Apache Airflow
Apache Airflow is a workflow orchestration platform that enables users to programmatically author, schedule, and monitor data pipelines. It uses Python to define workflows as directed acyclic graphs and supports scalable, distributed execution across various environments.
- Apache OpenWhisk
Apache OpenWhisk is a serverless cloud platform that executes functions in response to events. It supports multiple languages, scales automatically, and integrates with external services, enabling developers to build event-driven applications without managing infrastructure.
- Apache Hamilton
Apache Hamilton is a general-purpose Python framework for building dataflows using regular functions. It automatically constructs a Directed Acyclic Graph (DAG) from function dependencies, enabling execution, visualization, and monitoring. It supports scalable, modular, and testable workflows across diverse environments and integrates with modern data platforms.
- Apache Pulsar
Apache Pulsar is an open-source, cloud-native messaging and streaming platform designed for high-performance, low-latency communication. It supports millions of messages per second, offers multi-tenancy, geo-replication, and tiered storage, and simplifies application architecture with scalable, distributed processing.
- Apache HBase
Apache HBase is a distributed, scalable, non-relational database built on top of Hadoop and HDFS. It provides real-time read/write access to large datasets and is designed to host massive tables with billions of rows and millions of columns across commodity hardware.
- Apache TomEE
Apache TomEE is an enterprise-ready application server that extends Apache Tomcat with full Jakarta EE and MicroProfile support. It enables lightweight deployment of Java applications with powerful enterprise features, making it suitable for cloud-native and microservice architectures.
- Apache Samza
Apache Samza is a distributed stream processing framework that enables stateful applications to process real-time data from multiple sources with low latency and high throughput. It supports flexible deployment and integrates with systems like Kafka, HDFS, and cloud services.
- Apache Libcloud
Apache Libcloud is a Python library that provides a unified interface for interacting with multiple cloud service providers. It simplifies cloud management by abstracting provider-specific APIs, enabling consistent access to compute, storage, DNS, and other services across different platforms.
- Apache NiFi
Apache NiFi is a powerful and user-friendly system for automating the movement, transformation, and management of data across systems. It enables secure, scalable, and real-time dataflow for cybersecurity, observability, event streams, and AI pipelines.
- Apache Portable Runtime (APR)
Apache Portable Runtime (APR) is a cross-platform library that provides a consistent interface to system-level functionalities like memory management, file I/O, threading, and networking. It enables developers to write portable applications with predictable behavior across different operating systems, reducing platform-specific coding requirements.
- Apache Ratis
Apache Ratis is a Java-based open-source implementation of the Raft consensus protocol. It enables reliable replication of state across distributed systems, making it ideal for building fault-tolerant applications that require consistency and high availability.
- Apache Beehive
Apache Beehive is a Java framework that simplifies J2EE development using metadata-driven programming. It includes NetUI for web applications, Controls for enterprise resource access, and Web Service Metadata for building annotated web services. Beehive reduces boilerplate code and enhances modularity in enterprise applications.
- Apache Ozone
Apache Ozone is a scalable, distributed object store designed for big data, analytics, and cloud-native applications. It supports billions of objects, S3-compatible APIs, and Hadoop integration, offering strong consistency, flexible durability, and secure, efficient storage for hybrid and large-scale environments.
- Apache Click
Apache Click is a modern Java EE web application framework that provides a rich client-style programming model. It is designed to be easy to learn and use, enabling developers to build web applications quickly with a component and page-oriented architecture, event-based programming, and server-side Ajax support.
- Apache KIE
Apache KIE is an incubating open-source platform for business automation, offering tools for rules, workflows, decision modeling, and optimization through projects like Drools, jBPM, Kogito, and OptaPlanner.
- Apache Chemistry
Apache Chemistry is an open-source implementation of the CMIS standard, enabling interoperability between content management systems through client libraries and server frameworks in multiple programming languages.
- Apache ManifoldCF
Apache ManifoldCF is a framework for connecting content repositories with search engines. It supports crawling, transforming, and indexing data from various sources into systems like Solr or Elasticsearch, enabling secure and scalable enterprise search integration.
- Apache Flex
Apache Flex is an open-source framework for building expressive web, desktop, and mobile applications using MXML and ActionScript. It enables cross-platform development with rich user interfaces and integrates with Adobe AIR for native deployment across devices.
- Apache Rivet
Apache Rivet is a web development tool that integrates the Tcl scripting language with the Apache HTTP Server, enabling dynamic content generation and efficient server-side scripting for building flexible and high-performance web applications.
- Apache MyFaces
Apache MyFaces is an open-source project that provides implementations and extensions for JavaServer Faces. It enables the development of component-based web applications with reusable UI elements, supporting various JSF versions and offering integration with frameworks like CDI and Spring.
- Apache Teaclave
Apache Teaclave is an open-source secure computing platform that enables developers to build memory-safe Trusted Applications using Rust across Intel SGX, Arm TrustZone, and other TEE platforms. It supports confidential computing through modular SDKs, reusable components, and developer-friendly emulation tools.
- Apache Ignite
Apache Ignite is a distributed database for high-performance computing with in-memory speed. It scales across memory and disk, supports SQL, ACID transactions, and real-time processing, making it ideal for accelerating applications, handling hybrid transactional/analytical workloads, and building low-latency data platforms.
- Apache ShenYu
Apache ShenYu is an asynchronous, high-performance, cross-language API gateway designed for traffic governance in microservices and cloud-native environments. It supports multiple protocols and provides dynamic routing, plugin extensibility, and observability features for scalable and responsive service management.
- Apache HiveMind
Apache HiveMind is a lightweight services and configuration microkernel framework for Java. It supports modular application design through dependency injection and dynamic service configuration, enhancing encapsulation, reusability, and testability in enterprise software development.
- Apache VCL
Apache VCL is a cloud computing platform that provides remote access to virtual and physical computing environments. It enables users to reserve and use customized resources through a web interface, supporting education, research, and enterprise needs with flexible scheduling and provisioning.
- Apache Torque
Apache Torque is an object-relational mapper for Java that generates database access classes from an XML schema. It allows developers to interact with relational databases using Java objects, simplifying data manipulation and promoting database independence through code generation.
- Apache FOP
Apache FOP is a Java-based print formatter that converts XSL formatting objects (XSL-FO) into output formats like PDF, PS, PCL, and PNG. It reads XML-based FO trees and renders paginated documents, primarily targeting PDF output for publishing and document generation.
- Apache Empire-db
Apache Empire-db is a lightweight Java framework that provides full SQL freedom for working with relational databases. It avoids traditional ORM limitations by allowing developers to write complex, type-safe SQL queries directly in Java code without relying on annotations, mapping files, or string literals.
- Apache Iceberg
Apache Iceberg is a high-performance open table format for large analytic datasets. It enables reliable SQL-like operations on big data and supports multiple engines like Spark, Flink, Trino, and Hive, allowing concurrent access and advanced features such as schema evolution, hidden partitioning, and time travel.
- Apache Chukwa
Apache Chukwa is an open-source data collection system for monitoring large distributed systems, built on Hadoop’s HDFS and MapReduce, offering scalable log aggregation and analysis capabilities.
- Apache Whisker
Apache Whisker helps maintain accurate legal documentation for assembled software applications. It audits and generates licensing files using metadata, supporting compliance across diverse build environments.
- Apache StreamPipes
Apache StreamPipes is an open-source Industrial IoT toolbox that enables users to connect, analyze, and visualize data streams in real time. It offers a no-code interface for building data pipelines, integrating various protocols, and deploying analytics across industrial environments.
- Apache Axis2/Java Core
Apache Axis2 is a web services engine supporting SOAP, REST, and JSON. It offers high performance, modularity, and extensibility, enabling developers to build and deploy services efficiently. Axis2 supports hot deployment, asynchronous messaging, and integrates with WSDL, making it ideal for scalable service-oriented applications.
- Apache Traffic Control
Apache Traffic Control is an open-source platform for building large-scale content delivery networks. It uses Apache Traffic Server for caching and provides a distributed, scalable, and redundant architecture to efficiently deliver content. It includes modules for routing, monitoring, statistics, and management.
- Apache Qpid
Apache Qpid is a messaging system based on the AMQP protocol that enables reliable, asynchronous communication between distributed applications. It includes brokers, routers, and client libraries for building scalable and secure messaging infrastructures across multiple platforms.
- Apache Johnzon
Apache Johnzon is a Java library that implements the JSON Processing (JSR-353) and JSON Binding (JSON-B) specifications. It provides a flexible and extensible framework for serializing and deserializing Java objects to and from JSON, with support for REST integration, WebSocket communication, and advanced customization features.
- Apache Avro
Apache Avro is a data serialization system that enables efficient, compact, and schema-based data exchange across programming languages, supporting dynamic typing and schema evolution for reliable communication in distributed systems.
- Apache Karaf
Apache Karaf is a lightweight, modular runtime designed for enterprise applications. It supports various frameworks and technologies, enabling flexible deployment on-premise or in the cloud. Karaf simplifies application management with features like dynamic configuration, remote access, clustering, and monitoring.
- Apache MADlib
Apache MADlib is a scalable machine learning library for in-database analytics. It runs advanced algorithms directly within PostgreSQL and Greenplum databases, enabling efficient data science workflows without moving data, and supports classification, regression, clustering, and deep learning.
- Apache James
Apache James is a modular, open-source mail server platform written in Java. It supports standard email protocols and offers extensible components for email processing, storage, and administration. Designed for flexibility and scalability, James can be deployed as a standalone or distributed mail server for enterprise-grade messaging.
- Apache Allura
Apache Allura is a web-based platform for managing software development projects. It supports source code repositories, issue tracking, wikis, blogs, and forums, enabling collaborative development and project management across multiple teams and tools in a unified environment.
- Apache XML Graphics
Apache XML Graphics is a project focused on converting XML data into graphical output. It includes tools like Apache Batik for SVG processing and Apache FOP for formatting XML documents into formats such as PDF, PostScript, and RTF.
- Apache Kafka
Apache Kafka is an open-source distributed event streaming platform used for building high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. It enables real-time data processing with scalability, fault tolerance, and low latency across diverse industries.
- Apache Airavata
Apache Airavata is a software framework for composing, managing, executing, and monitoring large-scale scientific applications and workflows on distributed computing resources such as clusters, supercomputers, grids, and clouds.
- Apache DataSketches
Apache DataSketches is a library of streaming algorithms for approximate analysis of big data. It enables fast, single-pass computation of queries like distinct counts, quantiles, and frequency estimation with mathematically proven error bounds, reducing processing time from hours to seconds in real-time and batch systems.
- Apache Cayenne
Apache Cayenne is an open-source Java ORM framework that simplifies database integration. It features a powerful GUI tool for mapping, supports reverse engineering, and enables transparent transactions and code generation for persistent objects.
- mod_perl
mod\_perl integrates the Perl programming language with the Apache HTTP server, enabling high-performance web applications. It embeds a persistent Perl interpreter in the server, reducing startup overhead and allowing dynamic content generation, advanced request handling, and seamless execution of existing CGI scripts.
- Apache Steve
Apache Steve is a Python-based voting system developed by the Apache Software Foundation. It supports various voting methods including single transferable vote, yes/no/abstain, and party-based preferential voting, and is used internally for board elections and other decision-making processes.
- Apache HertzBeat
Apache HertzBeat is an open-source, agentless real-time monitoring system with customizable templates, high-performance clustering, and Prometheus compatibility. It supports diverse targets including cloud-native, databases, OS, and networks, offering alerting, status pages, and secure, scalable monitoring across isolated and distributed environments.
- Apache Kerby
Apache Kerby is a Java-based Kerberos implementation that provides a lightweight, embeddable KDC and client library. It supports modern authentication methods like PKINIT, OTP, and OAuth2, making it suitable for cloud-native and big data environments requiring secure identity management.
- Apache ShardingSphere
Apache ShardingSphere is a distributed database middleware that enhances traditional databases with features like data sharding, read/write splitting, encryption, and governance. It supports heterogeneous databases and enables scalable, secure, and flexible data management across diverse environments.
- Apache Kudu
Apache Kudu is a distributed columnar storage engine designed for fast analytics on rapidly changing data, combining efficient columnar scans with low-latency inserts and updates for real-time analytical workloads.
- Apache BuildStream
Apache BuildStream is a software integration tool that automates the building and integration of software components, including operating systems. It supports declarative stack definitions, multiple build technologies, and output formats, enabling fast, reproducible, and customizable software delivery across diverse platforms.
- Apache Sedona
Apache Sedona is a cluster computing system that enhances platforms like Spark and Flink with distributed spatial datasets and SQL, enabling efficient processing and analysis of large-scale spatial data across machines.
- Apache Cocoon
Apache Cocoon is a Java-based web development framework focused on separation of concerns and component-based architecture. It uses XML pipelines to manage content, logic, and presentation, enabling scalable, modular, and maintainable web applications.
- Apache Syncope
Apache Syncope is an open-source system for managing digital identities in enterprise environments. It supports identity and access management through tools and protocols for authentication, authorization, and synchronization across diverse repositories, formats, and models.
- Apache DataFu
Apache DataFu is a set of libraries for large-scale data processing in Hadoop, offering stable, well-tested tools for data mining, statistics, and incremental computation across Spark, Pig, and MapReduce environments.
- Apache DeltaSpike
Apache DeltaSpike is a collection of portable CDI extensions for Java applications. It provides modular, ready-to-use components that enhance security, configuration, transactions, and JSF integration. DeltaSpike simplifies enterprise development by extending CDI capabilities and supporting multiple Java EE containers and environments.
- Apache HttpComponents
Apache HttpComponents is a set of Java libraries for building HTTP clients and servers. It supports HTTP/1.1 and HTTP/2, offering both blocking and non-blocking I/O models. Designed for performance and flexibility, it enables secure, scalable, and standards-compliant HTTP communication in Java applications.
- Apache OpenJPA
Apache OpenJPA is an open-source Java persistence framework that implements the Jakarta Persistence API. It enables object-relational mapping and data management in Java applications, supporting integration with Java EE containers and lightweight frameworks like Spring and Tomcat.
- Apache Subversion
Apache Subversion is an open-source version control system that manages changes to files and directories over time. It supports collaborative development with centralized repositories, enabling reliable tracking, branching, merging, and auditing of source code and digital assets across projects and teams.
- Apache SeaTunnel
Apache SeaTunnel is a high-performance, distributed data integration platform that supports real-time synchronization of massive datasets. It simplifies complex data workflows across various sources and engines, enabling stable, scalable, and efficient data movement for enterprise-grade applications.
- Apache JMeter
Apache JMeter is an open-source Java application for load testing and performance measurement. It simulates multiple users and protocols to evaluate system behavior under stress, making it ideal for testing web applications, APIs, databases, and other services in development and production environments.
- Apache POI
Apache POI is a Java library for reading and writing Microsoft Office file formats. It supports Office Open XML (OOXML) and OLE2 formats, enabling manipulation of Word, Excel, PowerPoint, and other document types. POI is widely used for document automation, data extraction, and integration in enterprise applications.
- Apache Spark
Apache Spark is a fast, open-source engine for large-scale data processing. It supports batch and streaming analytics, machine learning, and graph processing, offering high-level APIs and in-memory computation for efficient and scalable data workflows across distributed environments.
- Apache Amoro
Apache Amoro is a lakehouse management system designed to unify batch and streaming data processing. It supports multiple table formats like Iceberg, Paimon, and Hive, and integrates with engines such as Flink and Spark. Amoro optimizes storage and query performance through self-managing features.
- Apache Log4cxx
Apache Log4cxx is a logging framework for C++ applications, modeled after Log4j. It provides configurable, efficient logging with support for multiple output targets and formats, enabling developers to monitor and debug applications across platforms using the Apache Portable Runtime for portability.
- Apache OpenWebBeans
Apache OpenWebBeans is an open-source implementation of the Contexts and Dependency Injection (CDI) specification for Jakarta EE. It provides a modular, high-performance CDI container that supports Java SE and EE environments, enabling developers to build loosely coupled, maintainable, and scalable enterprise applications.
- Apache Phoenix
Apache Phoenix enables low-latency SQL access over HBase, supporting full ACID transactions, standard JDBC APIs, and dynamic schema management for OLTP and operational analytics in Hadoop environments.
- Apache Wicket
Apache Wicket is a Java web application framework that uses a component-based architecture to simplify development. It allows developers to build dynamic, secure, and maintainable web applications using plain Java and HTML, with strong support for reuse, testing, and integration.
- Apache Drill
Apache Drill is a schema-free SQL query engine for big data exploration. It enables high-performance analysis on semi-structured data without requiring predefined schemas, supporting standard SQL and integration with BI tools and various NoSQL and cloud storage systems.
- Apache ServiceComb
Apache ServiceComb is an open-source microservices solution that helps developers and enterprises build, deploy, and manage cloud-native applications efficiently. It integrates multiple components for service registration, configuration, transactions, and governance, supporting scalable and flexible microservice architectures.
- Apache Santuario
Apache Santuario is an open-source library that implements XML Signature and XML Encryption standards, enabling secure processing of XML data in Java applications through DOM and streaming APIs.
- Apache DB
Apache DB is a project of the Apache Software Foundation focused on creating and maintaining commercial-quality open-source database solutions. It provides freely available tools and technologies for relational and object-oriented data management, supporting developers with standards-based, scalable, and embeddable database components.
- Apache FtpServer
Apache FtpServer is a 100% Java-based FTP server designed to be portable, embeddable, and scalable. It supports standalone operation or integration into Java applications, with features like SSL/TLS, user management, and event handling via the Ftplet API.
- Apache Linkis
Apache Linkis builds a unified computation middleware layer that connects upper-level applications with various underlying data engines. It simplifies big data architecture by standardizing interfaces, enabling governance, orchestration, and resource control across engines like Spark, Flink, and Presto.
- Apache CarbonData
Apache CarbonData is a fully indexed columnar data format optimized for fast analytics on big data platforms. It supports advanced compression, multi-level indexing, and seamless integration with Spark, enabling efficient queries over petabytes of data.
- Apache InLong
Apache InLong is a one-stop, full-scenario integration framework for massive data. It supports data ingestion, synchronization, and subscription with secure, reliable transmission. InLong handles both batch and stream processing, enabling real-time analytics, modeling, and scalable data applications across diverse industries.
- Apache Portals
Apache Portals is an open-source project focused on developing robust, full-featured portal software. It provides a unified platform for integrating web services and applications, enabling flexible deployment across intranet, extranet, and internet environments. Portals support modular, scalable, and customizable enterprise solutions.
- Apache PyLucene
Apache PyLucene is a Python extension that enables access to Java Lucene’s full-text indexing and search capabilities. It embeds a Java Virtual Machine into a Python process, allowing Python applications to leverage Lucene’s powerful search features through a machine-generated Python module.
- Apache Guacamole
Apache Guacamole is a clientless remote desktop gateway that allows access to computers via standard protocols like VNC, RDP, and SSH. It requires no plugins or client software, functioning entirely through a web browser using HTML5.
- Apache Any23
Apache Any23 is a library, web service, and command-line tool that extracts structured RDF data from diverse web documents. It supports formats like RDFa, Microformats, JSON-LD, HTML5 Microdata, CSV, and YAML, enabling seamless conversion of web content into machine-readable triples.
- Apache Superset
Apache Superset is a modern, enterprise-ready business intelligence web application. It enables data exploration and visualization through a no-code chart builder, a powerful SQL editor, and a lightweight semantic layer. Superset integrates with nearly any SQL database or data engine and supports scalable, cloud-native deployments.
- Apache GraphAr
Apache GraphAr is a graph data format designed for efficient storage and retrieval of large-scale graph datasets. It supports cross-language access, out-of-core processing, and integration with tools like Apache Arrow, enabling scalable graph analytics in distributed and cloud-native environments.
- Apache Jena
Apache Jena is a Java framework for building Semantic Web and Linked Data applications. It provides tools for working with RDF data, SPARQL queries, and ontologies, enabling developers to model, store, query, and reason over semantic data using W3C standards.
- Apache Livy
Apache Livy is a REST-based service for interacting with Apache Spark, enabling remote job submission, result retrieval, and Spark context management. It supports multi-user, fault-tolerant access from web or mobile apps without requiring Spark client installations.
- Apache Cassandra
Apache Cassandra is a distributed NoSQL database designed for high availability and scalability. It handles large volumes of data across multiple servers with no single point of failure, making it ideal for mission-critical applications requiring fault tolerance and low-latency performance.
- Apache MXNet
Apache MXNet is a flexible and efficient open-source deep learning framework designed for research and production. It supports both symbolic and imperative programming, enabling scalable training and deployment of deep neural networks across multiple platforms and languages.
- Apache Pekko
Apache Pekko is an open-source framework for building concurrent, distributed, resilient, and elastic applications. It uses the Actor Model to simplify concurrency and includes libraries for persistence, streams, HTTP, and more. Designed for the JVM, it supports Java and Scala and is a fork of Akka 2.6.x.
- Apache Continuum
Apache Continuum is an enterprise-ready continuous integration server that automates builds, manages releases, and integrates with popular build tools and version control systems, helping teams maintain consistent and reliable software delivery processes.
- Apache DevLake
Apache DevLake is an open-source platform that aggregates and analyzes data from DevOps tools to provide insights into software development processes, helping teams improve productivity, track metrics like DORA, and make data-driven decisions.
- Apache Yetus
Apache Yetus is a set of tools and libraries that support contribution and release processes for software projects. It automates patch testing, interface documentation, and release note generation, helping maintain code quality and consistency across large, collaborative development environments.
- Apache Directory Studio
Apache Directory Studio is an Eclipse-based LDAP browser and directory client designed for use with any LDAP server, especially ApacheDS. It provides tools for browsing, editing, and managing LDAP directories, schemas, and configurations in a multi-platform environment.
- Apache StreamPark
Apache StreamPark is a cloud-native platform that simplifies real-time stream processing using Apache Flink and Spark. It offers easy deployment, plug-and-play connectors, and unified management for streaming and batch jobs.
- Apache Aries
Apache Aries is a modular Java framework that implements OSGi enterprise specifications to support dynamic, service-oriented application development. It enables the creation of isolated, pluggable components and applications that can run across various OSGi-based environments with support for dependency injection, transactions, and modular packaging.
- Apache Storm
Apache Storm is a distributed real-time computation system for processing large streams of data quickly and reliably. It supports real-time analytics, machine learning, and continuous computation, offering scalability, fault tolerance, and integration with various data sources and processing tools.
- Apache JDO
Apache JDO (Java Data Objects) is a standard API for transparent persistence of Java objects. It allows developers to store and retrieve POJOs in various datastores including relational databases, object databases, and file systems, promoting separation of concerns and database independence.
- Apache AGE
Apache AGE is a PostgreSQL extension that enables graph database functionality, allowing users to model and query data as nodes and edges while retaining full SQL capabilities. It supports hybrid queries using SQL and Cypher, making it ideal for analyzing complex relationships within relational databases.
- Apache Woden
Apache Woden is a Java library for reading, writing, and manipulating WSDL documents. It supports WSDL 2.0 and aims to handle past and future versions. Designed for high performance, it provides APIs and implementations usable in other Apache projects like Axis2.
- Apache IoTDB
Apache IoTDB is a low-cost, high-performance time-series database designed specifically for IoT applications. It efficiently handles massive volumes of time-series data with high sampling frequencies, frequent disorder, and diverse analytical needs. IoTDB supports real-time stream processing, high compression, and scalable architecture for industrial-scale deployments.
- Apache ActiveMQ
Apache ActiveMQ is an open-source, multi-protocol, Java-based message broker that enables asynchronous communication between applications. It supports various messaging protocols and client languages, making it suitable for scalable, distributed systems and enterprise integration patterns.
- Apache Creadur
Apache Creadur is a top-level project of the Apache Software Foundation that provides tools for auditing and managing legal documentation in software distributions. It includes utilities like Apache Rat, Tentacles, and Whisker, each designed to help ensure license compliance and metadata accuracy across open-source projects.
- Apache Zeppelin
Apache Zeppelin is a web-based notebook for interactive data analytics and collaborative documents supporting multiple languages like SQL, Python, Scala, and R. It integrates with Apache Spark and other backends, enabling data visualization, dynamic forms, and real-time collaboration.
- Apache OpenNLP
Apache OpenNLP is a machine learning-based toolkit for processing natural language text. It supports tasks such as sentence segmentation, tokenization, part-of-speech tagging, named entity recognition, parsing, and coreference resolution, enabling developers to build intelligent language-aware applications.
- Apache Maven Doxia
Apache Doxia is a content generation framework for creating static and dynamic documentation. It supports multiple lightweight markup languages and is used extensively by Apache Maven to render project documentation. Doxia enables developers to produce readable and maintainable content across various formats.
- Apache BookKeeper
Apache BookKeeper is a scalable, fault-tolerant, low-latency storage service optimized for real-time workloads. It ensures durability, replication, and strong consistency, making it ideal for building reliable distributed systems and applications that require high-performance log storage.
- Apache Felix
Apache Felix is an open-source implementation of the OSGi framework specification, enabling modular Java application development. It supports dynamic deployment and lifecycle management of components, making it suitable for embedded systems, desktop applications, and enterprise solutions requiring flexible and scalable architecture.
- Apache EventMesh
Apache EventMesh is a serverless event middleware platform designed for building distributed event-driven applications. It supports event orchestration, schema management, and integration with various messaging systems, enabling scalable, flexible, and reliable communication across microservices and cloud-native environments.
- Apache Xalan
Apache Xalan is a software library that transforms XML documents using XSLT stylesheets. It supports both Java and C++ implementations and enables conversion of XML data into HTML, text, or other XML formats for data presentation and integration.
- Apache Incubator
Apache Incubator is the gateway for new projects to join the Apache Software Foundation. It provides mentorship, infrastructure, and governance guidance to help open-source initiatives grow sustainable communities and meet Apache standards before becoming top-level projects.
- Apache UIMA
Apache UIMA is an open-source framework for analyzing unstructured data such as text, audio, and video. It enables developers to build modular pipelines that extract structured information, supporting scalable and interoperable processing across languages and platforms.
- Apache Camel
Apache Camel is an open-source integration framework that enables seamless data exchange between diverse systems using enterprise integration patterns and over 300 components.
- Apache Polaris
Apache Polaris is an open-source metadata catalog for Apache Iceberg tables, enabling secure, centralized access and governance across multiple query engines and cloud platforms using the Iceberg REST API.
- Apache DataFusion
Apache DataFusion is a fast, extensible query engine written in Rust using Apache Arrow. It provides SQL and DataFrame APIs, supports multiple file formats, and features a vectorized, multi-threaded execution engine. DataFusion is ideal for building high-performance, data-centric systems and analytics platforms.
- Apache SpamAssassin
Apache SpamAssassin is an open-source anti-spam platform that filters and classifies email to block unsolicited bulk messages. It uses a scoring framework and plug-ins for heuristic and statistical analysis, including Bayesian filtering, DNS blocklists, and collaborative filtering, ensuring flexible and effective spam detection.
- Apache DirectMemory
Apache DirectMemory is an off-heap cache for the Java Virtual Machine, designed to store large volumes of objects outside the heap to reduce garbage collection overhead and improve application performance.
- Apache Thrift
Apache Thrift is a software framework for scalable cross-language services development. It combines a code generation engine with a runtime library to enable efficient and seamless communication between applications written in different programming languages using remote procedure calls.
- Apache Curator
Apache Curator is a Java/JVM client library for Apache ZooKeeper, a distributed coordination service. It includes a high-level API framework and utilities to make using ZooKeeper easier and more reliable. It also provides recipes for common use cases and extensions like service discovery and asynchronous DSL.
- Apache Helix
Apache Helix is a generic cluster management framework for distributed systems. It automates resource assignment, node failure recovery, load balancing, and reconfiguration across partitioned and replicated resources, enabling scalable, fault-tolerant operations with minimal custom code.
- Apache Knox
Apache Knox is a gateway that provides a single access point for interacting with REST APIs and UIs of Apache Hadoop clusters, offering perimeter security, authentication, and simplified access management for enterprise deployments.
- Apache Jackrabbit
Apache Jackrabbit is a Java-based content repository that implements the JCR API. It provides hierarchical storage, versioning, querying, and access control for structured and unstructured content, making it ideal for content management systems and applications requiring flexible, standards-compliant data storage.
- Apache Fineract
Apache Fineract is an open-source core banking platform that enables financial institutions to deliver digital financial services. It supports loan and savings products, real-time accounting, and API-driven integration, making it ideal for microfinance, fintech, and banking applications focused on scalability and financial inclusion.
- Apache Log4j
Apache Log4j is a powerful and flexible Java-based logging framework designed for high-performance, reliable, and extensible logging in enterprise applications. It separates API from implementation, supports multiple output formats and destinations, and enables dynamic configuration and plugin-based customization.
- Apache Celeborn
Celeborn is an intermediate data service for Big Data compute engines like ETL, OLAP, and streaming systems, designed to boost performance, stability, and flexibility by managing shuffle and spilled data efficiently.
- Apache Kyuubi
Apache Kyuubi is a distributed, multi-tenant gateway that provides serverless SQL access to data lakehouses. It enables high-performance analytics by abstracting complex backend engines like Apache Spark, Flink, and Trino, offering a unified SQL interface for interactive and batch workloads across diverse data sources.
- Apache Juneau
Apache Juneau is a lightweight Java framework for serializing and parsing data formats and building RESTful web services. It supports multiple formats like JSON, XML, and HTML, and simplifies API development with annotation-based configuration and modular components for client-server communication.
- Apache Parquet
Apache Parquet is an open-source, column-oriented data file format optimized for efficient data storage and retrieval. It supports high-performance compression and encoding schemes, making it ideal for handling complex, large-scale data across various programming languages and analytics tools.
- Apache YuniKorn
Apache YuniKorn is a universal resource scheduler for Kubernetes that enables efficient, fair, and flexible scheduling of batch and streaming workloads in cloud-native environments, supporting multi-tenancy and hierarchical queues for better resource management.
- Apache APISIX
Apache APISIX is a cloud-native, open-source API Gateway and AI Gateway designed for high-performance traffic management. It supports dynamic routing, plugin hot-loading, and advanced security features, enabling scalable and secure microservices communication and AI workload optimization across diverse platforms.
- Apache Doris
Apache Doris is a modern, open-source data warehouse designed for real-time analytics at scale. Built on a massively parallel processing architecture, it delivers lightning-fast queries, supports high concurrency, and integrates seamlessly with data lakes and streaming platforms for unified, low-latency analytical workloads.
- Apache OFBiz
Apache OFBiz is an open-source suite of enterprise business applications designed for automation across industries. It includes a powerful Java-based web framework and a set of integrated modules for ERP, CRM, e-commerce, manufacturing, and more. OFBiz is highly customizable and scalable for diverse business needs.
- Apache Mynewt
Apache Mynewt is a real-time operating system designed for microcontroller-based devices. It supports secure, scalable, and modular development for IoT applications, offering networking stacks like BLE, LoRaWAN, and TCP/IP, along with tools for firmware management, diagnostics, and cross-platform compatibility.
- Apache Crunch
Apache Crunch is a Java library for creating data pipelines on Hadoop, simplifying complex MapReduce tasks with a high-level API for joins, aggregations, and transformations across structured and semi-structured data.
- Apache .NET Ant Library
The Apache .NET Antlib enables building and testing .NET applications using Apache Ant. It supports cross-platform development with tasks for NUnit, NAnt, MSBuild, and WiX, and adapts execution for Mono or Microsoft .NET environments.
- Apache Tobago
Apache Tobago is a UI component library for JavaServer Faces (JSF) that simplifies the creation of modern, responsive web applications. It provides a rich set of reusable components with built-in theming and layout management, reducing the need for manual HTML, CSS, and JavaScript coding.
- Apache Pinot
Apache Pinot is a real-time distributed OLAP datastore designed for ultra low-latency analytics at high throughput. It supports both batch and streaming data ingestion, enabling fast, scalable, and cost-effective analytics for user-facing applications and dashboards.
- Apache Vysper
Apache Vysper is a modular XMPP server that supports core messaging and presence protocols. It enables real-time communication using open standards and can be embedded into Java applications or run standalone, making it suitable for custom and federated messaging solutions.
- Apache Gravitino
Apache Gravitino is a geo-distributed, federated metadata lake that unifies metadata management and governance across diverse data and AI assets, supporting multi-engine access and direct integration with various sources.
- Apache Celix
Apache Celix is a framework based on the OSGi specification, adapted for C and C++. It enables dynamic modular software development using component and service-oriented programming, supporting runtime-installed bundles and in-process service interaction.
- Apache HTTP Server
Apache HTTP Server is a widely used open-source web server that delivers high-performance, secure, and flexible HTTP services. It supports dynamic content, modular architecture, and cross-platform deployment, making it suitable for personal websites, enterprise applications, and cloud environments.
- Apache Ranger
Apache Ranger is a security framework for managing data access policies across big data platforms. It provides centralized authorization, fine-grained control, and auditing to ensure secure and compliant data usage in enterprise environments.
- Apache Kvrocks
Apache Kvrocks is a distributed key-value NoSQL database that uses RocksDB as its storage engine and supports the Redis protocol for high-performance, scalable data management.
- Apache Pig
Apache Pig is a high-level platform for processing large data sets using Hadoop. It simplifies data analysis through its scripting language Pig Latin, allowing users to write complex data transformations without needing to code in MapReduce.
- Apache CXF
Apache CXF is an open-source framework for building and deploying web services using standards like SOAP and REST. It supports multiple protocols and transports, enabling secure, scalable, and flexible service-oriented applications in Java.
- Apache Maven
Apache Maven is a build automation and project management tool for Java-based applications. It uses a declarative XML configuration to manage dependencies, compile code, run tests, and package projects, promoting consistency, scalability, and reproducibility across development environments.
- Apache Paimon
Apache Paimon is a lakehouse storage format that supports real-time and batch processing with engines like Flink and Spark. It combines a lake format with LSM structure to enable real-time streaming updates, flexible data management, and efficient querying for large-scale data architectures.
- Apache Tomcat
Apache Tomcat is an open-source Java servlet container and web server that implements Jakarta EE specifications. It enables developers to deploy and manage Java-based web applications efficiently, offering high performance, scalability, and reliability for enterprise and cloud environments.
- Apache Brooklyn
Apache Brooklyn is a framework for modeling, deploying, and managing applications across cloud and non-cloud environments. It uses blueprints to define application architectures and autonomic policies to monitor and adapt systems in real time, enabling scalable, resilient, and portable application operations.
- Apache Cordova
Apache Cordova is an open-source mobile development framework that enables building cross-platform apps using HTML5, CSS3, and JavaScript. It wraps web applications in native containers, allowing access to device features through standardized APIs for deployment across multiple mobile platforms.