rl-list.com
UPDATED 2026.06.07
rl-list.com · Vendors · General Reasoning

General Reasoning

Open source
medium confidence
www.gr.inc ↗ · Status: active confirmed · Founded 2025

General Reasoning is an AI research lab (operating research hub in London; legal entity General Reasoning, Inc. registered in the US) building open RL environments and infrastructure for training and evaluating agents over long horizons. Its OpenReward platform and Open Reward Standard (ORS) provide an open specification for connecting language models to community-built RL environments, with 330+ environments accessible through one API.

Key facts
Headquarters
London, United Kingdom (Shoreditch), operating research hub; legal entity General Reasoning, Inc. registered in San Francisco/US per SEC Form Dconfirmed cite
Headcount band
11-50reported cite
Total raised
$10.9Mreported cite
Last round
Equity offering, $10,904,992 sold, filed 2025-07-11 (SEC Form D; stage not labeled, reported as ~2025 seed-stage by Crunchbase)reported cite
SOC 2
unknown
What they sell
environmentsconfirmed cite
Open source
yesconfirmed cite
Deployment
managed-hosted / self-hosted / APIconfirmed cite

Scale & velocity

Current headcount
~10 (LinkedIn public snippet; '11-50' band)reported cite
Headcount growth
unknown
Open roles
3confirmed cite
Other locations
unknown cite
Distributed / remote
noestimated cite

Research depth

Has researchers
yesconfirmed cite
Researcher count
~6 named individuals on site (founders + research/eng staff); counted from research/team pageestimated cite
Backgrounds
Ross Taylor (co-founder/CEO) - ex-Meta AI/FAIR, research lead on Galactica, led reasoning for Llama 2/Llama 3; co-founded Papers with Code (acquired by Meta), Founding team previously led open language model development at Meta (Galactica, Llama 2, Llama 3), Other named staff/directors: Kip Parker, Chengxi Wang, Thomas Grady, Iliyan Zarov, Henry Coursereported cite
Papers / benchmarks
KellyBench: long-horizon sequential decision-making benchmark - https://www.gr.inc/releases/introducing-kellybench, OpenReward / Open Reward Standard - https://www.gr.inc/releases/introducing-openreward, Galactica, Llama 2, Llama 3 (founders' prior work at Meta)confirmed cite

Capital

Total raised
~$10.9M (amount sold per SEC Form D 2025-07-11; corroborated by Crunchbase)reported cite
Last round
Equity offering, $10,904,992 sold, filed 2025-07-11 (SEC Form D; stage not labeled, reported as ~2025 seed-stage by Crunchbase)reported cite
Investors
unknown
Valuation
unknown
Revenue signals
unknown

Security & compliance

SOC 2
unknown
Other certifications
unknown
Security page
unknown

Product

What they sell
environmentsconfirmed cite
Open source
yesconfirmed cite
License
Apache-2.0 (firehorse harness repo); Open Reward Standard described as open-source, specific license not statedreported cite
Deployment model
managed-hosted / self-hosted / APIconfirmed cite
Maturity
GAestimated cite
Notable customers
NVIDIA self-claimed Nebius self-claimed Eigent self-claimed OpenAI self-claimed Meta self-claimed cite

Buyer analysis

Best fit: Teams wanting open, portable RL environments and long-horizon agent benchmarks they can run anywhere (self-hosted) or via managed/API hosting.

How we verified this

Confirmed this is the correct entity matching the 'research grade, open-leaning' note: Ross Taylor's General Reasoning / gr.inc, building open RL environments (OpenReward, Open Reward Standard), distinct from the same-named 'General Analysis' (a separate $10M-seed agentic-security startup) and 'General Intuition' (a $133.7M gaming-RL lab), neither of which the draft confused. Verified founding year 2025 across LinkedIn, careers page, and SEC Form D. Headcount ~10 / 11-50 band confirmed via LinkedIn snippet, corrected the draft's implication of larger scale is not an issue (already small). Key downgrades: total_raised and last_round moved from 'confirmed' to 'reported' since the SEC Form D reports amount sold in a single offering (not lifetime total) with no stage label; Crunchbase corroborates ~$10.9M (Aug 2025). No investors are disclosed anywhere credible, kept unknown. notable_customers corrected: NVIDIA/Nebius/Eigent (and the previously-omitted OpenAI and Meta) appear on the vendor's own OpenReward page as environment providers/contributors, NOT verified paying customers, so all remain self-claimed; flagged frontier-lab ties for NVIDIA, OpenAI, and Meta. HQ clarified as London operating hub with US-registered legal entity (General Reasoning, Inc., San Francisco). SOC2/certifications/valuation/revenue correctly left unknown. Overall confidence: medium.

Related vendors

Sources

  1. www.gr.inc/ · 2026-06-07, Official homepage - positioning, products, founding team ex-Meta, contact, social links
  2. www.gr.inc/careers · 2026-06-07, Open roles (3), Shoreditch London HQ, 'research lab in chapters' model, founders named
  3. www.gr.inc/research · 2026-06-07, Team members listed, KellyBench benchmark
  4. www.gr.inc/releases/introducing-openreward · 2026-06-07, OpenReward + ORS open-source spec, deployment models, NVIDIA/Nebius/Eigent partners
  5. www.gr.inc/releases/introducing-kellybench · 2026-06-07, KellyBench long-horizon sequential decision-making benchmark
  6. uk.linkedin.com/company/general-reasoning · 2026-06-07, Public snippet - ~10-11 headcount, 11-50 band, founded 2025, London, long-horizon RL focus
  7. www.formds.com/issuers/general-reasoning-inc · 2026-06-07, SEC Form D - $10,904,992 equity, filed 2025-07-11, directors: Grady, Parker, Taylor, Wang
  8. github.com/GeneralReasoning · 2026-06-07, GitHub org - environment repos, firehorse harness Apache-2.0
  9. www.linkedin.com/in/rosstaylor90/ · 2026-06-07, Ross Taylor co-founder/CEO, ex-Meta AI
  10. www.interconnects.ai/p/interviewing-ross-taylor-on-llm-reasoning · 2026-06-07, Background: Ross Taylor led Galactica, Llama reasoning, Papers with Code
  11. huggingface.co/GeneralReasoning · 2026-06-07, Hugging Face org for General Reasoning datasets/models
Last updated 2026-06-07 · Every quantitative field carries a source and a confidence tag. Fields we could not source publicly are marked unknown, never estimated. See the methodology.