Timing: Assigned in Class 12, due before Class 13.
Prerequisites: You have a REMARK-in-progress for your course paper — the repository structure from Paper topic choice and REMARK setup plus whatever you have built since. You have completed Ask an AI to assess REMARK compliance (baseline tier). This assignment is a focused follow-up that targets substantive improvements rather than checklist compliance.
Use a strong AI model (Claude Opus 4.7 recommended) as a critical reviewer of your REMARK repository. Have it read your code, configuration, and documentation against the standards at econ-ark/REMARK, and produce a prioritized list of substantive improvements — changes to reproducibility, model fidelity, documentation completeness, or code quality. Not cosmetic changes.
Then pick the top 3–5 and actually make them, as a branch + PR against your own REMARK repo’s upstream.
Critical review of a codebase — finding places where reproduce.sh will silently fail, spotting a subtle departure from the paper’s model, identifying the parts of the documentation a new reader cannot follow — is exactly the kind of work that rewards a strong model. Use Claude Opus 4.7 in Cursor or Claude Code, with the agent in read-and-analyze mode (not auto-edit mode) so you see every suggestion before it lands. If Opus 4.7 is unavailable, Sonnet 4.6 is an acceptable substitute; avoid Haiku for this task.
Please review my REMARK-in-progress at
<YOUR REPO URL>against the baseline-REMARK standards at
https://github.com/econ-ark/REMARK/blob/main/STANDARD.mdhttps://github.com/econ-ark/REMARK/blob/main/How-To-Make-A-REMARK.mdhttps://github.com/econ-ark/REMARK/blob/main/README.mdIgnore the published-tier requirements (Zenodo DOI etc.). Focus on the standard / baseline tier.
I want a prioritized list of substantive improvements — not cosmetic ones — categorized by:
Reproducibility. Can
reproduce.shrun end-to-end from a clean clone? Where is it fragile, silently failing, or reliant on state that is not captured bybinder/environment.yml?Model fidelity. Does the code compute what the paper actually specifies? Are there silent approximations, simplifications, or departures (e.g., a different utility kernel, a different shock process, a different timing convention) that should be either (a) justified and documented, or (b) fixed?
Documentation completeness. Is
CITATION.cffpresent and correct? Is there a top-levelREMARK.md(or equivalent) explaining what the REMARK does, which paper result(s) it reproduces, and how a reader should start?Code quality. Where would an ambitious follow-up student get stuck — magic numbers, undocumented data dependencies, tight coupling to your local filesystem, environment assumptions?
Anything else a thoughtful reviewer would flag before you submit a PR to the REMARK catalog.
Output format: a numbered list of 5–10 proposed improvements. For each give (a) the problem in one sentence, (b) the proposed change, (c) where it goes (file, function, script, line range). Omit cosmetic or formatting items.
Substance improvements from Claude Opus 4.7 review.Submit the PR URL and include in the PR description:
econ-ark/REMARK — standards repo.