rl-list.com · Vendors · General Reasoning

General Reasoning

Open source

medium confidence

www.gr.inc ↗ · Status: active confirmed · Founded 2025

General Reasoning is an AI research lab (operating research hub in London; legal entity General Reasoning, Inc. registered in the US) building open RL environments and infrastructure for training and evaluating agents over long horizons. Its OpenReward platform and Open Reward Standard (ORS) provide an open specification for connecting language models to community-built RL environments, with 330+ environments accessible through one API.

Key facts

Headquarters

London, United Kingdom (Shoreditch), operating research hub; legal entity General Reasoning, Inc. registered in San Francisco/US per SEC Form Dconfirmed cite

Headcount band

11-50reported cite

Total raised

$10.9Mreported cite

Last round

Equity offering, $10,904,992 sold, filed 2025-07-11 (SEC Form D; stage not labeled, reported as ~2025 seed-stage by Crunchbase)reported cite

SOC 2

unknown

What they sell

environmentsconfirmed cite

Open source

yesconfirmed cite

Deployment

managed-hosted / self-hosted / APIconfirmed cite

Scale & velocity

Current headcount

~10 (LinkedIn public snippet; '11-50' band)reported cite

Headcount growth

unknown

Open roles

3confirmed cite

Other locations

unknown cite

Distributed / remote

noestimated cite

Research depth

Has researchers

yesconfirmed cite

Researcher count

~6 named individuals on site (founders + research/eng staff); counted from research/team pageestimated cite

Backgrounds

Ross Taylor (co-founder/CEO) - ex-Meta AI/FAIR, research lead on Galactica, led reasoning for Llama 2/Llama 3; co-founded Papers with Code (acquired by Meta), Founding team previously led open language model development at Meta (Galactica, Llama 2, Llama 3), Other named staff/directors: Kip Parker, Chengxi Wang, Thomas Grady, Iliyan Zarov, Henry Coursereported cite

Papers / benchmarks

KellyBench: long-horizon sequential decision-making benchmark - https://www.gr.inc/releases/introducing-kellybench, OpenReward / Open Reward Standard - https://www.gr.inc/releases/introducing-openreward, Galactica, Llama 2, Llama 3 (founders' prior work at Meta)confirmed cite

Capital

Total raised

~$10.9M (amount sold per SEC Form D 2025-07-11; corroborated by Crunchbase)reported cite

Last round

Equity offering, $10,904,992 sold, filed 2025-07-11 (SEC Form D; stage not labeled, reported as ~2025 seed-stage by Crunchbase)reported cite

Investors

unknown

Valuation

unknown

Revenue signals

unknown

Security & compliance

SOC 2

unknown

Other certifications

unknown

Security page

unknown

Product

What they sell

environmentsconfirmed cite

Open source

yesconfirmed cite

License

Apache-2.0 (firehorse harness repo); Open Reward Standard described as open-source, specific license not statedreported cite

Deployment model

managed-hosted / self-hosted / APIconfirmed cite

Maturity

GAestimated cite

Notable customers

⚑NVIDIA self-claimed Nebius self-claimed Eigent self-claimed ⚑OpenAI self-claimed ⚑Meta self-claimed cite

Buyer analysis

Best fit: Teams wanting open, portable RL environments and long-horizon agent benchmarks they can run anywhere (self-hosted) or via managed/API hosting.

How we verified this

Confirmed this is the correct entity matching the 'research grade, open-leaning' note: Ross Taylor's General Reasoning / gr.inc, building open RL environments (OpenReward, Open Reward Standard), distinct from the same-named 'General Analysis' (a separate $10M-seed agentic-security startup) and 'General Intuition' (a $133.7M gaming-RL lab), neither of which the draft confused. Verified founding year 2025 across LinkedIn, careers page, and SEC Form D. Headcount ~10 / 11-50 band confirmed via LinkedIn snippet, corrected the draft's implication of larger scale is not an issue (already small). Key downgrades: total_raised and last_round moved from 'confirmed' to 'reported' since the SEC Form D reports amount sold in a single offering (not lifetime total) with no stage label; Crunchbase corroborates ~$10.9M (Aug 2025). No investors are disclosed anywhere credible, kept unknown. notable_customers corrected: NVIDIA/Nebius/Eigent (and the previously-omitted OpenAI and Meta) appear on the vendor's own OpenReward page as environment providers/contributors, NOT verified paying customers, so all remain self-claimed; flagged frontier-lab ties for NVIDIA, OpenAI, and Meta. HQ clarified as London operating hub with US-registered legal entity (General Reasoning, Inc., San Francisco). SOC2/certifications/valuation/revenue correctly left unknown. Overall confidence: medium.

Related vendors

Sources

www.gr.inc/ · 2026-06-07, Official homepage - positioning, products, founding team ex-Meta, contact, social links
www.gr.inc/careers · 2026-06-07, Open roles (3), Shoreditch London HQ, 'research lab in chapters' model, founders named
www.gr.inc/research · 2026-06-07, Team members listed, KellyBench benchmark
www.gr.inc/releases/introducing-openreward · 2026-06-07, OpenReward + ORS open-source spec, deployment models, NVIDIA/Nebius/Eigent partners
www.gr.inc/releases/introducing-kellybench · 2026-06-07, KellyBench long-horizon sequential decision-making benchmark
uk.linkedin.com/company/general-reasoning · 2026-06-07, Public snippet - ~10-11 headcount, 11-50 band, founded 2025, London, long-horizon RL focus
www.formds.com/issuers/general-reasoning-inc · 2026-06-07, SEC Form D - $10,904,992 equity, filed 2025-07-11, directors: Grady, Parker, Taylor, Wang
github.com/GeneralReasoning · 2026-06-07, GitHub org - environment repos, firehorse harness Apache-2.0
www.linkedin.com/in/rosstaylor90/ · 2026-06-07, Ross Taylor co-founder/CEO, ex-Meta AI
www.interconnects.ai/p/interviewing-ross-taylor-on-llm-reasoning · 2026-06-07, Background: Ross Taylor led Galactica, Llama reasoning, Papers with Code
huggingface.co/GeneralReasoning · 2026-06-07, Hugging Face org for General Reasoning datasets/models

Last updated 2026-06-07 · Every quantitative field carries a source and a confidence tag. Fields we could not source publicly are marked unknown, never estimated. See the methodology.