๐ Home
๐ญ About
๐บ Programs
Overview
๐งช Open Source Research Experience
๐งช Summer of Reproducibility
๐ชบ Open Source Incubator Fellowship
๐ Open Source Education
๐ Resources
๐ Blog
๐ช Events
2022 Projects
Return to
2023 projects
*
All
UC Mentors
Reproducibility
Hide
Adaptive Load Balancers for Low-latency Multi-hop Networks
This project aims at designing efficient, adaptive link level load balancers for networks that handle different kinds of traffic, in particular networks where flows are heterogeneous in terms of their round trip times.
Katia Obraczka
Apache AsterixDB
AsterixDB is an open source parallel big-data management system. AsterixDB is a well-established Apache project that has beedddn active in research for more than 10 years. It provides a flexible data model that supports modern NoSQL applications with a powerful query processor that can scale to billions of records and terabytes of data.
Ahmed Eldawy
CephFS
CephFS is a distributed file system on top of Ceph. It is implemented as a distributed metadata service (MDS) that uses dynamic subtree balancing to trade parallelism for locality during a continually changing workloads.
Carlos Maltzahn
DirtViz (2022)
DirtViz is a project to visualize data collected from sensors deployed in sensor networks. We have deployed a number of sensors measuring qualities like soil moisture, temperature, current and voltage in outdoor settings.
Colleen Josephson
Eusocial Storage Devices
As storage devices get faster, data management tasks rob the host of CPU cycles and main memory bandwidth. The Eusocial project aims to create a new interface to storage devices that can leverage existing and new CPU and main memory resources to take over data management tasks like availability, recovery, and migrations.
Jianshen Liu
,
Carlos Maltzahn
FasTensor
FasTensor is a parallel execution engine for user-defined functions on multidimensional arrays. The user-defined functions follow the stencil metaphor used for scientific computing and is effective for expressing a wide range of computations for data analyses, including common aggregation operations from database management systems and advanced machine learning pipelines.
John Wu
HDF5
HDF5 is a unique technology suite that makes possible the management of extremely large and complex data collections. The HDF5 technology suite includes: A versatile data model that can represent very complex data objects and a wide variety of metadata.
Suren Byna
,
Houjun Tang
LiveHD (2022)
Projects for
LiveHD
. Lead Mentors:
Jose Renau
and
Sheng-Hong Wang
.
Jose Renau
Open Source Autonomous Vehicle Controller
The OSAVC is a vehicle-agnostic open source hardware and software project. This project is designed to provide a real-time hardware controller adaptable to any vehicle type, suitable for aerial, terrestrial, marine, or extraterrestrial vehicles.
Aaron Hunter
OpenRAM
OpenRAM is an award winning open-source Python framework to create the layout, netlists, timing and power models, placement and routing models, and other views necessary to use SRAMs in ASIC design.
Matthew Guthaus
OpenROAD - A Complete, Autonomous RTL-GDSII Flow for VLSI Designs
OpenROAD is a front-runner in open-source semiconductor design automation tools and know-how. OpenROAD reduces barriers of access and tool costs to democratize system and product innovation in silicon. The OpenROAD tool and flow provide an autonomous, no-human-in-the-loop, 24-hour RTL-GDSII capability to support low-overhead design exploration and implementation through tapeout.
Matthew Guthaus
Package Management & Reproducibility
Project ideas related to reproducibility and package management, especially as it relates to store type package managers (NixOS, Guix or Spack). Lead Mentor: Farid Zakaria mailto:fmzakari@ucsc.edu Investigate the dynamic linking landscape Topics: Operating Systems Compilers Linux Package Management NixOS Skills: Experience with systems programming and Linux familiarity Difficulty: Moderate to Challenging Size: Large (350 hours) Mentors: Farid Zakaria & Tom Scogland mailto:scogland1@llnl.
Farid Zakaria
Polyphorm / PolyPhy
Polyphorm is an agent-based system for reconstructing and visualizing optimal transport networks defined over sparse data. Rooted in astronomy and inspired by nature, we have used Polyphorm to reconstruct the Cosmic web structure, but also to discover network-like patterns in natural language data.
Oskar Elek
Proactive Data Containers (PDC)
Proactive Data Containers (PDC) are containers within a locus of storage (memory, NVRAM, disk, etc.) that store science data in an object-oriented manner. Managing data as objects enables powerful optimization opportunities for data movement and transformations, and storage mechanisms that take advantage of the deep storage hierarchy and enable automated performance tuning
Suren Byna
,
Houjun Tang
Skyhook Data Management
SkyhookDM The Skyhook Data Management project extends object storage with data management functionality for tabular data. SkyhookDM enables storing and query tabular data in the Ceph distributed object storage system. It thereby turns Ceph into an Apache Arrow-native storage system, utilizing the Arrow Dataset API to store and query data with server-side data processing, including selection and projection that can significantly reduce the data returned to the client.
Jayjeet Chakraborty
Cite
×