Skip to content
View teknium1's full-sized avatar

Sponsors

@gilesc
@StarKeyJON
@loweroctave
@shea256
@chrislengerich
@LastMileNow
@geohot
@enricoros
@iharabukhouski
@jshuadvd
@lsternlicht
@Myth727
@doziedotdev

Block or report teknium1

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
teknium1/README.md

Hello, I'm Teknium1 πŸ‘‹

I'm a Python Programmer, AI Enthusiast, and a Co-founder of NousResearch.

My work primarily involves AI and Data Engineering, contributing primarily by releasing open source Large Language Model (LLMs), datasets, synthetic data pipelines, and RL environments.

πŸš€ My Work

πŸ’Ό Nous Research

I've contributed significantly to the development of several opensource LLMs under Nous Research.

Here are a couple of them:

πŸš€ Personal Projects

On my personal huggingface, Teknium, I have released several models, including my work on Replit-3b Model & OpenHermes:

πŸ’» Github Projects

I've been part of several intriguing projects on GitHub. Here are a few of them:

  • LLM-Benchmark-Logs - A repository full of benchmarks I've done on various LLMs, originally inside of Nous' discord but it became too disorganized, so now lives on Github.
  • LLM-Logbook - A temporary project that became too expensive to do, collection of responses for 100 random crowdsourced prompts to various LLMs.
  • GPTeacher - A collection of modular datasets generated by GPT-4, for training LLMs.
  • RawTransform - A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
  • stanford_alpaca-replit - Modified Stanford-Alpaca Trainer for Training Replit's Code Model.
  • alpaca-roleplay-discordbot - An LLM discord bot that roleplays!
  • alpaca-discord - A Simple Discord Bot for the Alpaca LLM.

πŸ’Ό CarperAI / StabilityAI

Have worked on researching, planning ablations, and cleaning/filtering the dataset for:

Both are 10% Orca replications trained on Llama-1 and Llama-2 70B. Also working on domain expert knowledge and task distillation.

πŸ’Ό Open Orca

Working with the Open Orca team on data cleaning, networking, ablations, and more:

πŸ“« Get in Touch

Popular repositories Loading

  1. GPTeacher GPTeacher Public

    A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer

    Python 1.6k 166

  2. Prompt-Engineering-Toolkit Prompt-Engineering-Toolkit Public

    JavaScript 423 35

  3. alpaca-roleplay-discordbot alpaca-roleplay-discordbot Public

    A discord bot that roleplays!

    Python 152 17

  4. LLM-Benchmark-Logs LLM-Benchmark-Logs Public

    Just a bunch of benchmark logs for different LLMs

    121 2

  5. ShareGPT-Builder ShareGPT-Builder Public

    Python 120 19

  6. alpaca-discord alpaca-discord Public

    A Simple Discord Bot for the Alpaca LLM

    Python 98 8