Add documentation on preferring binary enums over booleans by cmungall · Pull Request #59 · linkml/valuesets

cmungall · 2026-03-08T14:59:31Z

Summary

This PR adds a comprehensive explanation document to the LinkML schema documentation that advocates for using two-element enumerations instead of boolean types in schema design.

Key Changes

New document: docs/explanations/binary-enums-vs-booleans.md - A detailed guide explaining why binary enums are preferable to booleans, including:
- Five key reasons to prefer enums (extensibility, clarity, avoiding cryptic negations, self-documentation, and the "name the scale" pattern)
- Guidance on when booleans remain acceptable
- Practical LinkML examples showing binary enum patterns
- A comparison table summarizing the trade-offs
- References to the Tidy Design Principles
Updated navigation: docs/index.md - Added a new "Explanations" section with a link to the binary enums guide

Notable Details

The document draws from established design principles (tidyverse's Tidy Design Principles) and provides concrete examples relevant to LinkML users, including ontology mapping with meaning annotations and practical use cases like sort direction and strand orientation.

https://claude.ai/code/session_012NRY1cEkUdQyExXFvJZUmv

Add documentation explaining why binary enums are often preferred over boolean types, drawing from the tidyverse design principles. Covers extensibility, clarity, self-documenting code, and the "name the scale" pattern. Includes LinkML-specific examples. Reference: https://design.tidyverse.org/boolean-strategies.html

claude · 2026-03-08T14:59:43Z

Claude finished @cmungall's task —— View job

Code Review Analysis

I've completed a comprehensive review of this documentation PR. Overall, this is an excellent addition to the LinkML valuesets project that provides clear guidance on schema design principles.

Todo List:

Read repository guidelines (CLAUDE.md)
Review new documentation file
Review index.md changes
Analyze content quality and best practices
Check for technical accuracy
Provide structured feedback

✅ Strengths

High-Quality Technical Content:

The document provides clear, well-reasoned arguments backed by established design principles (tidyverse Tidy Design)
Excellent use of concrete examples that demonstrate abstract concepts
Strong technical writing that balances accessibility with depth

LinkML Integration:

Examples properly follow LinkML syntax and conventions
Appropriate use of meaning annotations for ontology mapping (lines 111, 113, 120, 122)
Aligns well with repository guidelines for enum naming (CamelCase) and permissible values (UPPERCASE)

Practical Value:

The extensibility argument (Section 1) is particularly compelling for schema evolution
The comparison table (lines 154-160) provides an excellent quick reference
Real-world examples make abstract principles concrete

🔍 Areas for Enhancement

Minor Technical Issues:

Ontology Reference Verification (docs/explanations/binary-enums-vs-booleans.md:111-122)
- The document uses ontology IDs (SIO:001395, SO:0000853, etc.) but per CLAUDE.md guidelines: "ALWAYS check IDs, NEVER guess them. Use OLS to check."
- Consider adding a note about ID verification or use placeholder examples
Consistency in Examples (docs/explanations/binary-enums-vs-booleans.md:126-132)
- The PresenceStatus enum introduces a third value (NOT_DETERMINED) which somewhat contradicts the "binary enum" concept
- Consider renaming to "DetectionStatus" or using a purely binary example

Documentation Structure:

The "When Booleans Are Acceptable" section (lines 92-100) could benefit from LinkML examples to match the pattern established elsewhere
Consider adding cross-references to other LinkML documentation

🚀 Suggestions

Add Validation Command Reference:
- Since this affects schema design, mention using just validate after implementing changes (per CLAUDE.md guidelines)
Consider Migration Guidance:
- Brief section on how to migrate existing boolean fields to enums without breaking compatibility
Link to Repository Examples:
- Reference existing enums in the valuesets schemas as real-world examples

🔒 Security & Performance

No security concerns - This is documentation only
No performance impact - Documentation change only
No test coverage needed - Documentation files don't require unit tests

📋 Index.md Changes

The changes to docs/index.md are minimal and appropriate:

Adds new "Explanations" section with proper link
Follows existing documentation structure
No issues detected

🎯 Overall Assessment

This PR significantly enhances the project's documentation quality. The binary enums guidance addresses a common schema design challenge with clear, practical advice. The document is well-researched, technically sound, and aligns with established software design principles.

Recommendation: ✅ Approve with minor suggestions

The minor issues noted above don't detract from the overall value of this contribution. This document will serve as an excellent reference for LinkML schema designers.

Copilot

Pull request overview

Adds a new “Explanations” document to the LinkML schema docs advocating for two-element enums over booleans, and links it from the docs home page to improve discoverability.

Changes:

Added a new explanation doc: “Prefer Binary Enums Over Booleans”.
Updated docs/index.md to include a new “Explanations” section linking to the new doc.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
docs/index.md	Adds an “Explanations” section and links to the new guide.
docs/explanations/binary-enums-vs-booleans.md	New documentation page explaining the rationale and providing LinkML-oriented examples.

Copilot · 2026-03-08T15:02:36Z

docs/explanations/binary-enums-vs-booleans.md

+### Binary Enum Pattern
+
+```yaml
+enums:
+  SortDirection:
+    permissible_values:
+      ASCENDING:
+        description: Sort from lowest to highest
+        meaning: SIO:001395  # ascending order
+      DESCENDING:
+        description: Sort from highest to lowest
+        meaning: SIO:001396  # descending order
+
+  StrandOrientation:
+    permissible_values:
+      FORWARD:
+        description: Forward/plus strand
+        meaning: SO:0000853  # forward_strand
+      REVERSE:
+        description: Reverse/minus strand
+        meaning: SO:0000854  # reverse_strand
+
+  PresenceStatus:
+    permissible_values:
+      PRESENT:
+        description: The entity is present
+      ABSENT:
+        description: The entity is absent
+      NOT_DETERMINED:
+        description: Presence could not be determined
+```


This section is titled “Binary Enum Pattern”, but the PresenceStatus example includes three permissible values (adds NOT_DETERMINED). That’s a good illustration of extensibility, but it’s no longer a binary enum and may confuse readers. Consider moving PresenceStatus into its own “Extending a binary enum” example or renaming the section to reflect that it includes non-binary cases.

Copilot · 2026-03-08T15:02:36Z

docs/explanations/binary-enums-vs-booleans.md

+sample:
+  is_control: false
+
+# Self-explanatory
+sample:


The YAML snippet under “Self-Documenting Code” contains two sample: top-level keys in the same YAML document, which makes the example invalid (the second key overwrites the first). Consider splitting this into two separate code blocks or using distinct keys (e.g., sample1/sample2) so readers can copy/paste valid YAML.

Suggested change

sample:

is_control: false

# Self-explanatory

sample:

sample1:

is_control: false

# Self-explanatory

sample2:

Copilot AI review requested due to automatic review settings March 8, 2026 14:59

Copilot started reviewing on behalf of cmungall March 8, 2026 15:00 View session

Copilot AI reviewed Mar 8, 2026

View reviewed changes

dragon-ai-agent merged commit 6038e9b into main Mar 8, 2026
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documentation on preferring binary enums over booleans#59

Add documentation on preferring binary enums over booleans#59
dragon-ai-agent merged 1 commit intomainfrom
claude/binary-enums-documentation-V0piC

cmungall commented Mar 8, 2026

Uh oh!

claude bot commented Mar 8, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 8, 2026

Uh oh!

Copilot AI Mar 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

cmungall commented Mar 8, 2026

Summary

Key Changes

Notable Details

Uh oh!

claude bot commented Mar 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Analysis

Todo List:

✅ Strengths

🔍 Areas for Enhancement

🚀 Suggestions

🔒 Security & Performance

📋 Index.md Changes

🎯 Overall Assessment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

claude bot commented Mar 8, 2026 •

edited

Loading