data-indexing

Star

Here are 16 public repositories matching this topic...

cocoindex-io / cocoindex

Star

Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!

Updated Nov 30, 2025
Rust

saturn-lab / BDMI-2019A

Star

Big Data and Machine Intelligence Course in Autumn 2019.

machine-learning database data-structures algorithms-and-data-structures data-indexing

Updated Jan 15, 2020
Jupyter Notebook

cocoindex-io / patient-intake-extraction

Star

Patient Intake Form Extraction using llm

machine-learning ocr etl healthcare data-indexing rag llm

Updated May 29, 2025
Python

TopTrenDev / solana-dex-data-indexer-substream

Star

🧠 Solana DEX Swap Data Indexer Substream-powered swap indexer for Solana — supports Pump.fun, PumpSwap, BonkFun, Meteora, Raydium, Orca & more. ⚡📊🔥 Designed for real-time trade analytics, MEV research, and on-chain data pipelines. 📡

orca dex substream data-indexing solana raydium data-indexer meteora pumpfun pumpswap letsbonk letsbonkfun bonkfun

Updated Aug 21, 2025
Rust

most-inesctec / I2Bplus-tree

Star

🌲 Improved Interval B+ tree implementation, in TS 🌲

tree analysis complexity interval-tree intervals temporal-data b-plus-tree data-indexing time-efficient logarithmic-complexity valida-time-data indexing-structure i2b-tree

Updated Dec 28, 2021
TypeScript

fshnkarimi / Similar-Paper-Reccomendation

Star

This repository contains an application designed to recommend scientific papers that are most similar to a given input paragraph. The application uses the llama and weaviate libraries to achieve this.

python llama gpt data-indexing weaviate streamlit vector-database large-language-models llm

Updated Oct 25, 2023
Jupyter Notebook

tangentlin / indexed-collection

Star

A zero-dependency library of classes that make filtering, sorting and observing changes to arrays easier and more efficient.

javascript data typescript index collectionview data-indexing

Updated Jul 12, 2025
TypeScript

datafast-network / datafast-runtime

Star

Datafast Runtime is a high-performance subgraph processing runtime which is written from scratch and designed to handle subgraphs with unparalleled speed & storage-efficiency

blockchain diy subgraph data-indexing thegraphprotocol subgraphs thegraph

Updated Nov 28, 2024
Rust

iron-hope-shop / bords-portfolio

Star

BORDS is an open-access reaction search engine that leverages Google's Open Reaction Database to provide ultra-fast, comprehensive access to millions of chemical reactions. Built with a modern cloud stack, it streamlines reaction data extraction, transformation, and indexing for researchers in chemistry and related fields.

react flask etl google-cloud open-access elastic-search chemical-reactions ord data-indexing reaction-search

Updated Feb 17, 2025
JavaScript

SciGaP / seagrid-data

Star

System for Managing the data generated by the SEAGrid Science Gateway

querying scientific-data data-indexing

Updated Sep 17, 2022
Java

Md-Emon-Hasan / Vector-Database

Star

Designed to store and retrieve high-dimensional data, such as embeddings, efficiently. It enables fast similarity searches by leveraging techniques.

Updated Jan 31, 2025
Jupyter Notebook

dappros / rag_demos

Sponsor

Star

Examples of RAG (Retrieval-Augmented Generation) with Ethora, LangChain, and OpenAI. Build knowledge-based AI assistants fast. Powered by Ethora Chat Component.

Updated Oct 29, 2025
Python

paocarvajal1912 / Forecasting_Net_Prophet

Star

Time series analysis showing trend, seasonality, and periodicity decomposition; and forecasting using Facebook Prophet. The analysis makes extensive use of indexing data tools and of the Pandas and Datetime libraries.

python time-series datetime pandas data-analysis trends facebook-prophet-forecasting data-indexing periodicity-analysis seasonality-analysis

Updated May 8, 2022
Jupyter Notebook

ahenrij / univ-rennes1-m2-inv-search-engine

Star

Python implementation of a TF-IDF/cosine based search engine

python search-engine data indexation data-indexing

Updated Nov 17, 2021
Python

hamzakhan0712 / FlaskSearch-API

Star

RESTful search API built with Flask and Elasticsearch. Features full-text search, data indexing, and query capabilities for Shakespeare plays dataset with scalable architecture and production-ready implementation.

Updated Nov 12, 2025
Python

Atiq-Data / Modern_Data_Warehouse

Star

A comprehensive guide to building a modern data warehouse using medallion Data Warehouse Architecture with SQL Server, including ETL processes, data modeling, and analytics.

data-science database data-engineering data-lake data-analysis data-integration sqlserver data-modeling data-normalization data-cleaning star-schema etl-pipeline data-indexing data-aggregator data-warehouse-architecture data-lakehouse

Updated Jul 29, 2025
TSQL

Improve this page

Add a description, image, and links to the data-indexing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-indexing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-indexing

Here are 16 public repositories matching this topic...

cocoindex-io / cocoindex

saturn-lab / BDMI-2019A

cocoindex-io / patient-intake-extraction

TopTrenDev / solana-dex-data-indexer-substream

most-inesctec / I2Bplus-tree

fshnkarimi / Similar-Paper-Reccomendation

tangentlin / indexed-collection

datafast-network / datafast-runtime

iron-hope-shop / bords-portfolio

SciGaP / seagrid-data

Md-Emon-Hasan / Vector-Database

dappros / rag_demos

paocarvajal1912 / Forecasting_Net_Prophet

ahenrij / univ-rennes1-m2-inv-search-engine

hamzakhan0712 / FlaskSearch-API

Atiq-Data / Modern_Data_Warehouse

Improve this page

Add this topic to your repo