Corpus de fidélité empirique
La liste publique des 50 articles utilisés pour mesurer la fidélité du scoring (ÉNS, gaussienne neutre H1).
Pour mesurer la fidélité de chaque mode, nous utilisons un corpus public et stable de 50 articles thématiques. Distribution alignée sur la cible théorique (gaussienne neutre, μ=50, σ=20) : 5A / 14B / 9C / 14D / 5E + 3 retracted (X).
Pourquoi un corpus figé ? Pour que la fidélité empirique soit reproductible : si nous changeons la grille ou un évaluateur, nous re-mesurons sur le même corpus → l'écart est mesurable, pas opinable.
Pourquoi 50 articles distribution / 28 articles paired ?
Sur les 50 articles du corpus, 29 disposent d'une référence pdf_ai mesurable et 28 sont effectivement utilisés pour la mesure de concordance (un article exclu pour abstract trop court). Les 21 autres sont scorés en abstract_ai parce que leur PDF est verrouillé derrière le paywall NEJM, Lancet, JAMA ou autres revues top-tier qui n'imposent pas de dépôt PMC.
Cette asymétrie n'est pas un défaut, c'est un choix méthodologique délibéré qui reflète la science médicale réelle : la médecine clinique de pointe est largement publiée dans des revues paywallées. Construire un corpus 100% paired aurait introduit un biais de sélection vers l'open access pur (BMJ, PLOS, Cochrane) et rendu inatteignable une distribution réaliste des paliers — la plupart des articles palier A étant précisément publiés en NEJM/Lancet.
Conséquence opérationnelle : la distribution paliers (5A/14B/9C/14D/5E/3X) est calculée sur les 50, et la fidélité empirique (concordance abstract_ai vs pdf_ai, concordance metadata_ai vs pdf_ai) est calculée sur les 28 paired. Les deux mesures restent honnêtes parce qu'elles répondent à deux questions différentes : « le corpus reflète-t-il la science médicale ? » et « les modes scorent-ils de manière cohérente entre eux ? ».
Limite assumée : la mesure de concordance ne couvre pas les articles palier A du corpus (tous abstract-only). Cette limite est une propriété structurelle de l'écosystème éditorial scientifique — pas un choix arbitraire de notre part.
Distribution observée vs cible H1
| A | B | C | D | E | X | |
|---|---|---|---|---|---|---|
| Cible H1 | 5 | 14 | 9 | 14 | 5 | 3 |
| Observé | 5 | 14 | 9 | 14 | 5 | 3 |
Les 50 articles du corpus
| Palier | Score | PMID | Titre | Journal | Année | Mode | Paired | Thématique | Signal |
|---|---|---|---|---|---|---|---|---|---|
| A | 78 | 36214590 | Effect of Colonoscopy Screening on Risks of Colorectal Cancer and Related Death | The New England Journal of Medicine | 2022 | abstract_ai | — | Onco | |
| A | 77 | 23117178 | The benefits and harms of breast cancer screening: an independent review. | Lancet (London, England) | 2012 | abstract_ai | — | Onco | |
| A | 77 | 26724178 | Blood pressure lowering for prevention of cardiovascular disease and death: a systematic review and meta-analysis. | Lancet (London, England) | 2016 | abstract_ai | — | Cardio | |
| A | 76 | 19297566 | Screening and prostate-cancer mortality in a randomized European study. | The New England Journal of Medicine | 2009 | abstract_ai | — | Onco | |
| A | 76 | 35353979 | Effect of Early Treatment with Ivermectin among Patients with Covid-19 | New England Journal of Medicine | 2022 | abstract_ai | — | COVID-iverm | |
| B | 73 | 37541528 | Efficacy of Probiotics in Irritable Bowel Syndrome: Systematic Review and Meta-analysis | Gastroenterology | 2023 | abstract_ai | — | MetaEpistemo | |
| B | 71 | 29897866 | Primary Prevention of Cardiovascular Disease with a Mediterranean Diet Supplemented with Extra-Virgin Olive Oil or Nuts | The New England journal of medicine | 2018 | abstract_ai | — | Nutrition | |
| B | 70 | 12421889 | A population-based study of measles, mumps, and rubella vaccination and autism | N Engl J Med | 2002 | abstract_ai | — | COVID-vaccines | |
| B | 70 | 34002089 | Neutralizing antibody levels are highly predictive of immune protection from symptomatic SARS-CoV-2 infection | Nature medicine | 2021 | abstract_ai | — | COVID-vaccines | |
| B | 69 | 34145166 | Ivermectin for Prevention and Treatment of COVID-19 Infection: A Systematic Review, Meta-analysis, and Trial Sequential Analysis to Inform Clinical Guidelines | American Journal of Therapeutics | 2021 | abstract_ai | — | COVID-iverm | |
| B | 68 | 27717303 | Ribociclib as First-Line Therapy for HR-Positive, Advanced Breast Cancer | The New England Journal of Medicine | 2016 | abstract_ai | — | Onco | |
| B | 67 | 25982160 | Prognostic value of grip strength: findings from the Prospective Urban Rural Epidemiology (PURE) study. | Lancet | 2015 | abstract_ai | — | Cardio | |
| B | 64 | 9887158 | Efficacy of bilateral prophylactic mastectomy in women with a family history of breast cancer. | The New England journal of medicine | 1999 | abstract_ai | — | Onco | |
| B | 63 | 33567185 | Once-Weekly Semaglutide in Adults with Overweight or Obesity | The New England Journal of Medicine | 2021 | abstract_ai | — | Nutrition | |
| B | 62 | 33545096 | Azithromycin in patients admitted to hospital with COVID-19 (RECOVERY): a randomised, controlled, open-label, platform trial. | Lancet (London, England) | 2021 | pdf_ai | ✓ | COVID-treatments | |
| B | 60 | 32678530 | Dexamethasone in Hospitalized Patients with Covid-19. | The New England journal of medicine | 2021 | pdf_ai | ✓ | COVID-treatments | |
| B | 57 | 33378609 | Efficacy and Safety of the mRNA-1273 SARS-CoV-2 Vaccine. | The New England journal of medicine | 2021 | pdf_ai | ✓ | COVID-vaccines | |
| B | 56 | 24519768 | Twenty five year follow-up for breast cancer incidence and mortality of the Canadian National Breast Screening Study: randomised screening trial. | BMJ (Clinical research ed.) | 2014 | pdf_ai | ✓ | Onco | |
| B | 56 | 26551272 | A Randomized Trial of Intensive versus Standard Blood-Pressure Control. | The New England journal of medicine | 2015 | pdf_ai | ✓ | Cardio | |
| C | 55 | 25073782 | Fruit and vegetable consumption and mortality from all causes, cardiovascular disease, and cancer: systematic review and dose-response meta-analysis of prospective cohort studies. | BMJ (Clinical research ed.) | 2014 | pdf_ai | ✓ | Nutrition | |
| C | 55 | 33301246 | Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine. | The New England journal of medicine | 2020 | pdf_ai | ✓ | COVID-vaccines | |
| C | 54 | 33306989 | Safety and efficacy of the ChAdOx1 nCoV-19 vaccine (AZD1222) against SARS-CoV-2: an interim analysis of four randomised controlled trials in Brazil, South Africa, and the UK. | Lancet (London, England) | 2021 | pdf_ai | ✓ | COVID-vaccines | |
| C | 52 | 34729549 | Adverse events of active and placebo groups in SARS-CoV-2 vaccine randomized trials: A systematic review. | The Lancet regional health. Europe | 2022 | pdf_ai | ✓ | COVID-vaccines | |
| C | 50 | 12677558 | Average risks of breast and ovarian cancer associated with BRCA1 or BRCA2 mutations detected in case Series unselected for family history: a combined analysis of 22 studies. | American journal of human genetics | 2003 | pdf_ai | ✓ | Genetics | |
| C | 48 | 19622552 | The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. | BMJ (Clinical research ed.) | 2009 | pdf_ai | ✓ | MetaEpistemo | |
| C | 47 | 18997196 | Rosuvastatin to prevent vascular events in men and women with elevated C-reactive protein. | The New England Journal of Medicine | 2008 | abstract_ai | — | Cardio | |
| C | 46 | 29388196 | Vaccines for preventing influenza in healthy adults | The Cochrane Database of Systematic Reviews | 2018 | pdf_ai | ✓ | Infectious | |
| C | 46 | 35854107 | The serotonin theory of depression: a systematic umbrella review of the evidence | Molecular Psychiatry | 2022 | pdf_ai | ✓ | MentalHealth | |
| D | 45 | 18303940 | Initial severity and antidepressant benefits: a meta-analysis of data submitted to the Food and Drug Administration. | PLoS medicine | 2008 | pdf_ai | ✓ | MentalHealth | |
| D | 45 | 22828485 | Global trends in antiretroviral resistance in treatment-naive individuals with HIV after rollout of antiretroviral treatment in resource-limited settings: a global collaborative study and meta-regression analysis. | Lancet (London, England) | 2012 | pdf_ai | ✓ | Infectious | |
| D | 45 | 35324894 | Artificial sweeteners and cancer risk: Results from the NutriNet-Santé population-based cohort study | PLOS Medicine | 2022 | pdf_ai | ✓ | Nutrition | |
| D | 44 | 20233825 | Stereotactic body radiation therapy for inoperable early stage lung cancer. | JAMA | 2010 | pdf_ai | ✓ | Onco | |
| D | 44 | 26099233 | Antibiotics for acute otitis media in children | Cochrane Database of Systematic Reviews | 2015 | pdf_ai | ✓ | Infectious | |
| D | 44 | 30305743 | The UK Biobank resource with deep phenotyping and genomic data. | Nature | 2018 | pdf_ai | ✓ | Genetics | |
| D | 40 | 35076665 | Myocarditis Cases Reported After mRNA-Based COVID-19 Vaccination in the US From December 2020 to August 2021. | JAMA | 2022 | pdf_ai | ✓ | COVID-vaccines | |
| D | 39 | 29477251 | Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis | The Lancet | 2018 | abstract_ai | — | MentalHealth | |
| D | 36 | 34477808 | Surveillance for Adverse Events After COVID-19 mRNA Vaccination. | JAMA | 2021 | pdf_ai | ✓ | COVID-vaccines | |
| D | 35 | 16060722 | Why most published research findings are false. | PLoS medicine | 2005 | pdf_ai | ✓ | MetaEpistemo | |
| D | 33 | 20332511 | CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. | BMJ (Clinical research ed.) | 2010 | pdf_ai | ✓ | MetaEpistemo | |
| D | 32 | 12813115 | The Epidemiology of Major Depressive Disorder: Results From the National Comorbidity Survey Replication (NCS-R) | JAMA | 2003 | abstract_ai | — | MentalHealth | |
| D | 30 | 18436948 | GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. | BMJ (Clinical research ed.) | 2008 | pdf_ai | ✓ | MetaEpistemo | |
| D | 29 | 20089734 | Saturated fat, carbohydrate, and cardiovascular disease. | The American journal of clinical nutrition | 2010 | pdf_ai | ✓ | Nutrition | |
| E | 25 | 17240006 | B cell mediated priming following pneumococcal colonization. | Vaccine | 2007 | pdf_ai | ✓ | Infectious | |
| E | 25 | 29253145 | Noncommunicable Diseases in People Living With HIV: Time for Integrated Care | J Infect Dis | 2017 | abstract_ai | — | Infectious | |
| E | 22 | 38658043 | General practice management after transition events: protocol for an experience-based co-design study. | BJGP open | 2024 | abstract_ai | — | MetaEpistemo | |
| E | 22 | 40123456 | Effects of fat emulsion-based early parenteral nutrition for patients after hemihepatectomy. | The British journal of nutrition | 2025 | abstract_ai | — | Nutrition | |
| E | 21 | 40000001 | Effect of remineralization product on the microhardness and surface roughness of enamel after bleaching agents. | American journal of dentistry | 2025 | abstract_ai | — | Other | |
| X | 0 | 32205204 | RETRACTED: Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial. | International journal of antimicrobial agents | 2020 | pdf_ai | ✓ | COVID-treatments | Retracté |
| X | 0 | 32356626 | Cardiovascular Disease, Drug Therapy, and Mortality in Covid-19. | The New England journal of medicine | 2020 | pdf_ai | ✓ | COVID-treatments | Retracté |
| X | 0 | 32450107 | RETRACTED: Hydroxychloroquine or chloroquine with or without a macrolide for treatment of COVID-19: a multinational registry analysis. | Lancet (London, England) | 2020 | pdf_ai | ✓ | COVID-treatments | Retracté |
Données brutes
Les 50 JSONs sont disponibles dans le dépôt source (`scorings/json/`). Chaque article contient le rawScore, les criteria, l'intégrité et les justifications IA. Reproductible $0 via `webapp/scripts/replay-fidelity.ts --fidelity-only`.
