Corpus de fidelidad empírica
La lista pública de los 50 artículos utilizados para medir la fidelidad del scoring (ÉNS, gaussiana neutra H1).
Para medir la fidelidad de cada modo de scoring, usamos un corpus público y estable de 50 artículos temáticos. Distribución alineada con el objetivo teórico (gaussiana neutra, μ=50, σ=20): 5A / 14B / 9C / 14D / 5E + 3 retractados (X).
¿Por qué un corpus fijo? Para que la fidelidad empírica sea reproducible: si cambiamos la rejilla o un evaluador, volvemos a medir sobre el mismo corpus → la brecha es cuantificable, no opinable.
¿Por qué 50 artículos distribución / 28 artículos paired?
De los 50 artículos del corpus, 29 tienen una referencia pdf_ai medible y 28 se utilizan efectivamente para la medición de concordancia (un artículo excluido por resumen demasiado corto). Los otros 21 se puntúan en modo abstract_ai porque su PDF está bloqueado tras el paywall de NEJM, Lancet, JAMA u otras revistas top-tier que no exigen depósito en PMC.
Esta asimetría no es un defecto, es una elección metodológica deliberada que refleja la ciencia médica real: la medicina clínica de vanguardia se publica mayoritariamente en revistas con paywall. Construir un corpus 100% paired habría introducido un sesgo de selección hacia el acceso abierto puro (BMJ, PLOS, Cochrane) y haría inalcanzable una distribución de niveles realista — la mayoría de los artículos del nivel A se publican precisamente en NEJM/Lancet.
Consecuencia operativa: la distribución de niveles (5A/14B/9C/14D/5E/3X) se calcula sobre los 50, y la fidelidad empírica (concordancia abstract_ai vs pdf_ai, concordancia metadata_ai vs pdf_ai) se calcula sobre los 28 paired. Ambas medidas siguen siendo honestas porque responden a dos preguntas diferentes: «¿refleja el corpus la ciencia médica?» y «¿puntúan los modos de forma coherente entre sí?».
Limitación asumida: la medida de concordancia no cubre los artículos del nivel A del corpus (todos abstract-only). Esta limitación es una propiedad estructural del ecosistema editorial científico — no una elección arbitraria nuestra.
Distribución observada vs objetivo H1
| A | B | C | D | E | X | |
|---|---|---|---|---|---|---|
| Cible H1 | 5 | 14 | 9 | 14 | 5 | 3 |
| Observé | 5 | 14 | 9 | 14 | 5 | 3 |
Los 50 artículos del corpus
| Nivel | Puntuación | PMID | Título | Revista | Año | Modo | Paired | Tema | Señal |
|---|---|---|---|---|---|---|---|---|---|
| A | 78 | 36214590 | Effect of Colonoscopy Screening on Risks of Colorectal Cancer and Related Death | The New England Journal of Medicine | 2022 | abstract_ai | — | Onco | |
| A | 77 | 23117178 | The benefits and harms of breast cancer screening: an independent review. | Lancet (London, England) | 2012 | abstract_ai | — | Onco | |
| A | 77 | 26724178 | Blood pressure lowering for prevention of cardiovascular disease and death: a systematic review and meta-analysis. | Lancet (London, England) | 2016 | abstract_ai | — | Cardio | |
| A | 76 | 19297566 | Screening and prostate-cancer mortality in a randomized European study. | The New England Journal of Medicine | 2009 | abstract_ai | — | Onco | |
| A | 76 | 35353979 | Effect of Early Treatment with Ivermectin among Patients with Covid-19 | New England Journal of Medicine | 2022 | abstract_ai | — | COVID-iverm | |
| B | 73 | 37541528 | Efficacy of Probiotics in Irritable Bowel Syndrome: Systematic Review and Meta-analysis | Gastroenterology | 2023 | abstract_ai | — | MetaEpistemo | |
| B | 71 | 29897866 | Primary Prevention of Cardiovascular Disease with a Mediterranean Diet Supplemented with Extra-Virgin Olive Oil or Nuts | The New England journal of medicine | 2018 | abstract_ai | — | Nutrition | |
| B | 70 | 12421889 | A population-based study of measles, mumps, and rubella vaccination and autism | N Engl J Med | 2002 | abstract_ai | — | COVID-vaccines | |
| B | 70 | 34002089 | Neutralizing antibody levels are highly predictive of immune protection from symptomatic SARS-CoV-2 infection | Nature medicine | 2021 | abstract_ai | — | COVID-vaccines | |
| B | 69 | 34145166 | Ivermectin for Prevention and Treatment of COVID-19 Infection: A Systematic Review, Meta-analysis, and Trial Sequential Analysis to Inform Clinical Guidelines | American Journal of Therapeutics | 2021 | abstract_ai | — | COVID-iverm | |
| B | 68 | 27717303 | Ribociclib as First-Line Therapy for HR-Positive, Advanced Breast Cancer | The New England Journal of Medicine | 2016 | abstract_ai | — | Onco | |
| B | 67 | 25982160 | Prognostic value of grip strength: findings from the Prospective Urban Rural Epidemiology (PURE) study. | Lancet | 2015 | abstract_ai | — | Cardio | |
| B | 64 | 9887158 | Efficacy of bilateral prophylactic mastectomy in women with a family history of breast cancer. | The New England journal of medicine | 1999 | abstract_ai | — | Onco | |
| B | 63 | 33567185 | Once-Weekly Semaglutide in Adults with Overweight or Obesity | The New England Journal of Medicine | 2021 | abstract_ai | — | Nutrition | |
| B | 62 | 33545096 | Azithromycin in patients admitted to hospital with COVID-19 (RECOVERY): a randomised, controlled, open-label, platform trial. | Lancet (London, England) | 2021 | pdf_ai | ✓ | COVID-treatments | |
| B | 60 | 32678530 | Dexamethasone in Hospitalized Patients with Covid-19. | The New England journal of medicine | 2021 | pdf_ai | ✓ | COVID-treatments | |
| B | 57 | 33378609 | Efficacy and Safety of the mRNA-1273 SARS-CoV-2 Vaccine. | The New England journal of medicine | 2021 | pdf_ai | ✓ | COVID-vaccines | |
| B | 56 | 24519768 | Twenty five year follow-up for breast cancer incidence and mortality of the Canadian National Breast Screening Study: randomised screening trial. | BMJ (Clinical research ed.) | 2014 | pdf_ai | ✓ | Onco | |
| B | 56 | 26551272 | A Randomized Trial of Intensive versus Standard Blood-Pressure Control. | The New England journal of medicine | 2015 | pdf_ai | ✓ | Cardio | |
| C | 55 | 25073782 | Fruit and vegetable consumption and mortality from all causes, cardiovascular disease, and cancer: systematic review and dose-response meta-analysis of prospective cohort studies. | BMJ (Clinical research ed.) | 2014 | pdf_ai | ✓ | Nutrition | |
| C | 55 | 33301246 | Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine. | The New England journal of medicine | 2020 | pdf_ai | ✓ | COVID-vaccines | |
| C | 54 | 33306989 | Safety and efficacy of the ChAdOx1 nCoV-19 vaccine (AZD1222) against SARS-CoV-2: an interim analysis of four randomised controlled trials in Brazil, South Africa, and the UK. | Lancet (London, England) | 2021 | pdf_ai | ✓ | COVID-vaccines | |
| C | 52 | 34729549 | Adverse events of active and placebo groups in SARS-CoV-2 vaccine randomized trials: A systematic review. | The Lancet regional health. Europe | 2022 | pdf_ai | ✓ | COVID-vaccines | |
| C | 50 | 12677558 | Average risks of breast and ovarian cancer associated with BRCA1 or BRCA2 mutations detected in case Series unselected for family history: a combined analysis of 22 studies. | American journal of human genetics | 2003 | pdf_ai | ✓ | Genetics | |
| C | 48 | 19622552 | The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. | BMJ (Clinical research ed.) | 2009 | pdf_ai | ✓ | MetaEpistemo | |
| C | 47 | 18997196 | Rosuvastatin to prevent vascular events in men and women with elevated C-reactive protein. | The New England Journal of Medicine | 2008 | abstract_ai | — | Cardio | |
| C | 46 | 29388196 | Vaccines for preventing influenza in healthy adults | The Cochrane Database of Systematic Reviews | 2018 | pdf_ai | ✓ | Infectious | |
| C | 46 | 35854107 | The serotonin theory of depression: a systematic umbrella review of the evidence | Molecular Psychiatry | 2022 | pdf_ai | ✓ | MentalHealth | |
| D | 45 | 18303940 | Initial severity and antidepressant benefits: a meta-analysis of data submitted to the Food and Drug Administration. | PLoS medicine | 2008 | pdf_ai | ✓ | MentalHealth | |
| D | 45 | 22828485 | Global trends in antiretroviral resistance in treatment-naive individuals with HIV after rollout of antiretroviral treatment in resource-limited settings: a global collaborative study and meta-regression analysis. | Lancet (London, England) | 2012 | pdf_ai | ✓ | Infectious | |
| D | 45 | 35324894 | Artificial sweeteners and cancer risk: Results from the NutriNet-Santé population-based cohort study | PLOS Medicine | 2022 | pdf_ai | ✓ | Nutrition | |
| D | 44 | 20233825 | Stereotactic body radiation therapy for inoperable early stage lung cancer. | JAMA | 2010 | pdf_ai | ✓ | Onco | |
| D | 44 | 26099233 | Antibiotics for acute otitis media in children | Cochrane Database of Systematic Reviews | 2015 | pdf_ai | ✓ | Infectious | |
| D | 44 | 30305743 | The UK Biobank resource with deep phenotyping and genomic data. | Nature | 2018 | pdf_ai | ✓ | Genetics | |
| D | 40 | 35076665 | Myocarditis Cases Reported After mRNA-Based COVID-19 Vaccination in the US From December 2020 to August 2021. | JAMA | 2022 | pdf_ai | ✓ | COVID-vaccines | |
| D | 39 | 29477251 | Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis | The Lancet | 2018 | abstract_ai | — | MentalHealth | |
| D | 36 | 34477808 | Surveillance for Adverse Events After COVID-19 mRNA Vaccination. | JAMA | 2021 | pdf_ai | ✓ | COVID-vaccines | |
| D | 35 | 16060722 | Why most published research findings are false. | PLoS medicine | 2005 | pdf_ai | ✓ | MetaEpistemo | |
| D | 33 | 20332511 | CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. | BMJ (Clinical research ed.) | 2010 | pdf_ai | ✓ | MetaEpistemo | |
| D | 32 | 12813115 | The Epidemiology of Major Depressive Disorder: Results From the National Comorbidity Survey Replication (NCS-R) | JAMA | 2003 | abstract_ai | — | MentalHealth | |
| D | 30 | 18436948 | GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. | BMJ (Clinical research ed.) | 2008 | pdf_ai | ✓ | MetaEpistemo | |
| D | 29 | 20089734 | Saturated fat, carbohydrate, and cardiovascular disease. | The American journal of clinical nutrition | 2010 | pdf_ai | ✓ | Nutrition | |
| E | 25 | 17240006 | B cell mediated priming following pneumococcal colonization. | Vaccine | 2007 | pdf_ai | ✓ | Infectious | |
| E | 25 | 29253145 | Noncommunicable Diseases in People Living With HIV: Time for Integrated Care | J Infect Dis | 2017 | abstract_ai | — | Infectious | |
| E | 22 | 38658043 | General practice management after transition events: protocol for an experience-based co-design study. | BJGP open | 2024 | abstract_ai | — | MetaEpistemo | |
| E | 22 | 40123456 | Effects of fat emulsion-based early parenteral nutrition for patients after hemihepatectomy. | The British journal of nutrition | 2025 | abstract_ai | — | Nutrition | |
| E | 21 | 40000001 | Effect of remineralization product on the microhardness and surface roughness of enamel after bleaching agents. | American journal of dentistry | 2025 | abstract_ai | — | Other | |
| X | 0 | 32205204 | RETRACTED: Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial. | International journal of antimicrobial agents | 2020 | pdf_ai | ✓ | COVID-treatments | Retractado |
| X | 0 | 32356626 | Cardiovascular Disease, Drug Therapy, and Mortality in Covid-19. | The New England journal of medicine | 2020 | pdf_ai | ✓ | COVID-treatments | Retractado |
| X | 0 | 32450107 | RETRACTED: Hydroxychloroquine or chloroquine with or without a macrolide for treatment of COVID-19: a multinational registry analysis. | Lancet (London, England) | 2020 | pdf_ai | ✓ | COVID-treatments | Retractado |
Datos brutos
Los 50 JSON están disponibles en el repositorio fuente (`scorings/json/`). Cada artículo contiene el rawScore, los criterios, la integridad y las justificaciones IA. Reproducible $0 mediante `webapp/scripts/replay-fidelity.ts --fidelity-only`.
