Skip to content
View gilesc's full-sized avatar
  • Oklahoma Medical Research Foundation
  • Oklahoma City, OK
  • @cbgiles

Sponsoring

@teknium1

Organizations

@wrenlab

Block or report gilesc

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An open infrastructure to democratize and decentralize the development of superintelligence for humanity.

Rust 653 98 Updated Mar 24, 2026
Python 991 85 Updated Jan 25, 2026

Convert PDF to markdown + JSON quickly with high accuracy

Python 33,230 2,304 Updated Mar 10, 2026

NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases.

Jupyter Notebook 524 62 Updated Mar 27, 2026

LangChain 🔌 MCP

Python 3,455 391 Updated Apr 1, 2026

A Model Context Protocol (MCP) server that provides LLMs with real-time access to scientific papers from arXiv and OpenAlex.

TypeScript 46 7 Updated Aug 14, 2025

[COLM 2025] Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale

Python 127 7 Updated Mar 19, 2026

Implementation for OAgents: An Empirical Study of Building Effective Agents

Python 315 24 Updated Oct 13, 2025

The raw UserRL repo under construction

Python 97 9 Updated Sep 25, 2025

A self-hostable bookmark-everything app (links, notes and images) with AI-based automatic tagging and full text search

TypeScript 24,414 1,122 Updated Mar 29, 2026

[KDD'2026] "VideoRAG: Chat with Your Videos"

Python 2,826 403 Updated Mar 18, 2026

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 31,335 4,480 Updated Mar 30, 2026

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 57,792 4,774 Updated Mar 31, 2026

Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline

Python 850 80 Updated Mar 27, 2026

CRDT-based offline-first sync for SQLite. Syncs automatically with SQLite Cloud, PostgreSQL, and Supabase. No conflicts, no data loss, no backend to build. For offline-first apps and AI agents.

C 436 13 Updated Apr 1, 2026

Scripts for harvesting from repositories using OAI-PMH

Python 9 1 Updated Aug 18, 2025

SPARQL graph database

Rust 1,574 117 Updated Mar 31, 2026

RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.

Python 2,425 589 Updated Mar 30, 2026

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,973 252 Updated Mar 16, 2026

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 3,037 233 Updated Feb 9, 2026

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 4,011 329 Updated Aug 14, 2025

Make text LLMs listen and speak

Python 1,257 218 Updated Mar 26, 2026

A simple, extendable, and clean backtesting framework for portfolio allocation problems (and more).

Python 72 18 Updated Oct 26, 2025

A dialect of Lisp that's embedded in Python

Python 5,443 379 Updated Mar 21, 2026

Fast inference engine for Transformer models

C++ 4,400 461 Updated Feb 4, 2026

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Python 342 75 Updated Apr 1, 2025

Build local voice agents with open-source models

Python 4,627 535 Updated Mar 31, 2026

An interactive explorer for single-cell transcriptomics data

JavaScript 756 148 Updated Mar 29, 2026

a decentralized dataset generator and manipulator.

Python 18 3 Updated Apr 1, 2026

Distributed Training Over-The-Internet

987 48 Updated Oct 14, 2025
Next