Week 7 Papers — AI for Data, Code & Computation

6 papers downloaded across the literature on LLMs as data analysts, the data-leakage crisis, visualization, and the safety profile of agentic AI for science.

Each entry links to the canonical version of the paper — on arXiv, the journal, or the publisher. Where a paper is paywalled, the DOI is given for UCT-library access.

7.1 · Natural Language to Code

Is GPT-4 a Good Data Analyst?

Cheng, L., Li, X., & Bing, L. (2023)

View source · arXiv:2305.15038

Data Interpreter: An LLM Agent for Data Science

Hong, S., Lin, Y., Liu, B., et al. (2024)

View source · arXiv:2402.18679

7.2 · AI-Assisted Data Analysis in Practice

Leakage and the Reproducibility Crisis in ML-based Science

Kapoor, S., & Narayanan, A. (2023) — Patterns 4(9): 100804

View source · arXiv:2207.07048

7.3 · Visualization with AI

LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models

Dibia, V. (2023) — ACL System Demos

View source · arXiv:2303.02927

7.4 · Verification of AI-Generated Code

Re-uses Kapoor & Narayanan (above) as the central reading.

7.5 · Building Your Data Analysis Workflow

References Mineault (2026) Claude Code for Scientists — a Substack post, linked in the lesson.

7.6 · Agentic Data Analysis

Agentic AI for Scientific Discovery: A Survey of Progress, Challenges, and Future Directions

Gridach, M., Nanavati, J., Abidine, K. Z. E., et al. (2025)

View source · arXiv:2503.08979

Risks of AI scientists: prioritizing safeguarding over autonomy

Zardiashvili, L., et al. (2025) — Nature Communications

View source · DOI:10.1038/s41467-025-63913-1

7.7 · Hands-On Activities and Assessment

Assessment design.

Other Week 7 references are practitioner resources rather than papers:

Mineault, P. (2026). Claude Code for Scientists. neuroai.science
Wickham, H., Çetinkaya-Rundel, M., & Grolemund, G. (2023). R for Data Science (2e). r4ds.hadley.nz
Wilke, C. (2019). Fundamentals of Data Visualization. clauswilke.com/dataviz

Linked but not redistributed

Nature (2026). AI scientists are changing research — institutions, funders and publishers must respond. DOI:10.1038/d41586-026-00934-w 7.6

Nature editorial — free to read on Nature.com but no redistributable PDF endpoint.