Blog

Statistical Computing Series: Mealtime

The Statistical Computing Series hosted by Vanderbilt University Medical Center's Department of Biostatistics features presentations on the implementation of statistical models and methods, statistical computation, and graphics. These informal meetings allow experienced statisticians and developers to share their expertise on computing topics with practitioners across Vanderbilt. On Monday, November 25, 2024, at 1:30pm on Teams, principal biostatistician Josh DeClercq will present "Mealtime: A Shiny app for meal planning." Here is his description of the talk:

 

Have you ever come home from the grocery store and realize you have no idea what to cook? Do you have stacks of cookbooks that largely go unused? Mealtime is a data-driven approach to meal planning. This presentation will touch on how to deconstruct a recipe into its essential components to derive its "makeability" score. I'll explain how I scale this process to accommodate large numbers of recipes and ingredients and discuss how I leverage data to accommodate dietary restrictions. Finally, I will demonstrate how it all comes together in a Shiny application, sharing some tips and challenges I encountered along the way.

 

For access to this webinar, contact series organizer Ryan Moore.

Principal biostatistician Josh DeClercq

Vanderbilt Biostatistics at ENAR 2025 - Invited Preliminary Program

ENAR 2025 has published a preliminary lineup of invited sessions - we congratulate the department members and alumni whose proposals have been accepted! They include:

Assistant professor Gustavo Amorim - speaker, Methodological Considerations for the Design and Analysis of Observational Studies Reliant on Electronic Health Records Data

Professor Benjamin French - organizer/chair, Modern Statistical Challenges of Electronic Health Records Data

PhD candidate Yeji Ko - speaker, Modern Statistical Challenges of Electronic Health Records Data

Alumna Lucy McGowan (PhD 2018) - speaker, Missing Data and Multiple Imputation and Their Applications

PhD student Ashley Mullan - organizer, Collaboration 101: What a Scientist Seeks in a Statistician vs. What a Statistician Seeks in a Scientist

Professor Bryan Shepherd - speaker, Precision in EHR Data: Overcoming Challenges of Measurement Error in Health Outcomes

PhD candidate Jiangmei (Ruby) Xiong - organizer, Collaboration 101: What a Scientist Seeks in a Statistician vs. What a Statistician Seeks in a Scientist

The conference will take place in New Orleans from March 23 through March 26, 2025.

Asthma drug does not speed COVID-19 recovery: study

Dr. Sean Collins: the success of ACTIV-6 “is a testament to the extensive amount of work and expertise behind the scenes by the Vanderbilt Coordinating Center and the Department of Biostatistics at VUMC.”

All-department team publishes paper in Computational and Structural Biotechnology Journal

Congratulations to former visiting student Chia-Jung "Charlene" Chang (PhD candidate in biomedical engineering at National Cheng Kung University), research assistant professor Chih-Yuan Hsu, and professors Qi Liu and Yu Shyr on the publication of "VICTOR: Validation and inspection of cell type annotation through optimal regression." The article on this new method appeared online ahead of print on October 15 and will go to press as part of Computational and Structural Biotechnology Journal's December issue.  As described in the paper's abstract:

Single-cell RNA sequencing provides unprecedent opportunities to explore the heterogeneity and dynamics inherent in cellular biology. An essential step in the data analysis involves the automatic annotation of cells. Despite development of numerous tools for automated cell annotation, assessing the reliability of predicted annotations remains challenging, particularly for rare and unknown cell types. Here, we introduce VICTOR: Validation and inspection of cell type annotation through optimal regression. VICTOR aims to gauge the confidence of cell annotations by an elastic-net regularized regression with optimal thresholds. We demonstrated that VICTOR performed well in identifying inaccurate annotations, surpassing existing methods in diagnostic ability across various single-cell datasets, including within-platform, cross-platform, cross-studies, and cross-omics settings.

Figure 1 in the paper provides an " One example in diagnosing the reliability of cell annotations. a) diagnostic performance of singleR, scmap, SCINA, scPred, CHETAH, scClassify, and Seurat. b) diagnostic performance of VICTOR when applied to annotations from singleR, scmap, SCINA, scPred, CHETAH, scClassify, and Seurat.

 

Chia-Jung Chang

Vanderbilt Biostatistics at AMIA 2024

The 2024 AMIA (American Medical Informatics Association) Annual Symposium will take place in San Francisco from November 9 through November 13. Department members with work to be presented at the symposium include:

Saturday, November 9

Workshop 17, "REDCap on FHIR: Implementing and Using Clinical Data Interoperability Services" - professor Paul Harris, co-instructor/author

 

Sunday, November 10

Workshop 27, "Advancing Biomedical Research Using Multi-omics Data in the All of Us Researcher Workbench,"  8:30 am - co-authored by Paul Harris

Session 7, "Pediatric Health Informatics - Kid Coders,"  3:30 pm

"Revealing Patterns of Child Maltreatment Policy Differences and Demographic Dynamics using BERT-Networks and Clustering Approach" - co-authored by associate professor Rameela Raman 

 

Monday, November 11

Session 17, "LIEAF: Artificial Intelligence and Data Science in Health Informatics Education," 8:30 am

Enhancing Causes of Death Prediction from Electronic Health Records through Multi-Modal Integration of Structured and Unstructured EHR Data - co-authored by professor Michael Matheny

Session 22, "AI Fairness and Ethics - Justice League," 8:30 am

  • "Fairness of AI Collaboration and Suppression in Emergency Triage" - co-authored by professor Bradley Malin
  • "Enhancement of Fairness in AI for Chest X-ray Classification" - co-authored by Bradley Malin

 

Session 53, "Utilization Data and Data Utilization - Auditory Audits, Listening to the Data," 3:30 pm

"Optimizing Large Language Models for Discharge Prediction: Best Practices in Leveraging Electronic Health Record Audit Logs" - co-authored by Bradley Malin

Session 54, "Patient Generated Data - Organic Certified," 3:30 pm

"Examining Oral Anti-Cancer Medication Continuation Using Questionnaires, Prescription Refills, and Structured Electronic Health Records" - co-authored by professor Qingxia Chen, Bradley Malin, and alumnus Zhijun Yin (MS 2017)

Poster session 1, 5:00 pm

P114: "Machine Learning Methods for Estimating Gestational Age at Birth from Electronic Health Records" - co-authored by professor Leena Choi

P118: "Large Language Models Enhance the Identification of Emergency Department Visits for Symptomatic Kidney Stones" - co-authored by PhD candidate Siwei Zhang and assistant professor Yaomin Xu

P171: "Comparing EHR-recorded Race/Ethnicity to Self-reported Race/Ethnicity: Insights from the All of Us Research Program" - coauthored by Xiaoke (Sarah) Feng (first author), biostatistician Andrew Guide, assistant in biostatistics Shawn Garbett, and Qingxia Chen

 

Tuesday, November 12

Session 98, "Wearable Sensor Data - Data on the Go," 3:30 pm

"'I worry we’ll blow right by it': Barriers to Uptake of the STRATIFY CDSS for ED Discharge in Acute Heart Failure" - co-authored by associate professor Dandan Liu

 

Poster session 2, 5:00 pm

P05: "Utilizing Large Language Models (LLM) to Optimize Domain-Specific Natural Language Processing (NLP) for Identifying Patients with No Reason for Not Prescribing ACEI/ARB in Chronic Kidney Disease (CKD) Management" - co-authored by Michael Matheny

P27: "Assessing ChatGPT Responses to Alzheimer’s Disease Myths" - co-authored by Bradley Malin and Zhijun Yin

P117: "Algorithmic Matching of Unique Device Information to Electronic Health Record Data" - co-authored by Michael Matheny

P178: "A Study of Challenges In Algorithmic Transportability Between VHA Sites" - co-authored by Michael Matheny

P188: "Real-Time Automated Billing for Tobacco Treatment: A CDS Hook Approach for Simulating Clinician Facing Coding Prompts Within EHRs" - co-authored by Michael Matheny

 

Wednesday, November 13

Session 102, "Self-Service Software Tools for Clinical and Translational Research: Rationale, Benefits, Limitations, Challenges, and the Future," 8:00 am - Paul Harris, speaker

 

Updated 11.11.2024 to include P01.

Vanderbilt Biostatistics at WSDS 2024

The 2024 Women in Statistics and Data Science Conference is underway in Reston, Virginia, from October 16 through 18. We are proud of the department members and alumni involved with this year's meeting. They include:

MS student Zongyue Teng

 

Sarah Lotspeich (PhD 2021)

 

PhD student Ashley Mullan

 

Lead biostatistician Amy Perkins

 

Lucy D'Agostino McGowan (PhD 2018):

 

Statistical Computing Series: Intro to GitHub

The Department of Biostatistics' Statistical Computing Series focuses on the implementation of statistical models and methods, statistical computation and graphics. These informal meetings allow experienced statisticians and developers to share their expertise on computing topics with practitioners across Vanderbilt. On Thursday, October 31, at 1:00 pm, application developer Savannah Obregon will present "Introduction to GitHub," on Microsoft Teams:

Use GitHub for seamless collaboration and robust version control in your projects. This presentation will guide you through the essential features and best practices for using GitHub in a team setting. Learn how to manage repositories, branches, pull requests, and issues to streamline your workflow.

For an example of what's possible on GitHub, see Obregon's own website: https://smobregon.github.io. She was recently named winner of the department's IT Innovation Award and has delivered presentations at conferences such as R/Medicine. 

To obtain a link to this webinar, contact series organizer Ryan Moore.