Persistent filesystem design#1746

Draft

mlejva wants to merge 2 commits intomainfrom

cursor/persistent-filesystem-design-d2c8

Member

mlejva commented Jan 20, 2026

Add a design document for the persistent filesystem feature to outline its architecture, API, and implementation plan.


          docs: add persistent filesystem design document

8f24e66

This design document covers the V0 implementation of persistent volumes that can
be mounted in sandboxes and accessed via SDK/API:

- NFS v3 proxy server using willscott/go-nfs
- GCS backend for file storage
- Database schema for filesystem metadata
- API endpoints for filesystem CRUD and file operations
- SDK interface for TypeScript/JavaScript and Python
- Volume mount configuration for sandbox creation
- Security and observability considerations

Based on discussion in #proj-filesystem-persistence Slack channel.

Co-authored-by: vasek <vasek@e2b.dev>

cursor bot commented Jan 20, 2026

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
_{Learn more about Cursor Agents}

e2b-request-same-site-reviewers bot assigned jakubno

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+                  team_id UUID NOT NULL REFERENCES teams(id) ON DELETE CASCADE,
+                  name VARCHAR(255) NOT NULL,
+                  size_bytes BIGINT NOT NULL,  -- Quota/limit for the volume
+                  created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),

claude bot Jan 20, 2026

The SQL schema defines size_bytes BIGINT NOT NULL but the API spec uses sizeGB (integer gigabytes). This creates a conversion point that could lead to precision issues or overflow. Consider using bytes consistently throughout the API and SDK, or clearly document the conversion behavior (e.g., does 1.5GB get truncated to 1GB?).

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+                  created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
+                  updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
+                  UNIQUE(team_id, name)

claude bot Jan 20, 2026

The schema is missing updated_at timestamp update trigger. Based on existing patterns in the codebase (e.g., migrations/20231124185944_create_schemas_and_tables.sql), tables typically need a trigger to auto-update the updated_at field. Without this, the timestamp will never change after creation.

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+                  UNIQUE(team_id, name)
+              );
+              -- Index for efficient team lookups

claude bot Jan 20, 2026

Missing RLS (Row Level Security) policy. Based on existing patterns in the codebase, tables should have ENABLE ROW LEVEL SECURITY and appropriate policies to prevent cross-team data access. Without this, the security model relies solely on application-level checks, which is less secure than database-level enforcement.

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+                        type: integer
+                        minimum: 1
+                        maximum: 100
+                        description: Storage quota in gigabytes

claude bot Jan 20, 2026

The API spec allows maximum 100GB quotas (maximum: 100) but doesn't define minimum constraints. Should there be a minimum size (e.g., 1GB)? Also, consider if 100GB is appropriate for a PoC - this could lead to high storage costs if many users create max-size volumes.

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+                    tags: [filesystems]
+                    security:
+                      - ApiKeyAuth: []
+                    parameters:

claude bot Jan 20, 2026

The path parameter in file operations endpoints uses wildcard matching but lacks validation constraints. File paths need sanitization to prevent directory traversal attacks (e.g., ../../etc/passwd). The design should specify: (1) path validation rules, (2) whether absolute vs relative paths are allowed, and (3) how to handle symbolic links.

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md


		### Concurrency Considerations

		For V0, we allow concurrent access via NFS (no explicit locking):

claude bot Jan 20, 2026

GCS provides eventual consistency for concurrent writes, which means multiple sandboxes writing to the same file could result in data loss or corruption. The last write wins, but there's no guarantee of ordering. This is a significant data integrity risk for concurrent access scenarios. Consider adding file locking mechanisms or documenting this limitation prominently.

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+. **Orchestrator** resolves filesystem IDs to NFS server addresses
+. **Firecracker VM** starts with NFS client configured
+. **Init script** mounts NFS shares to specified paths:
+                 ```bash

claude bot Jan 20, 2026

Using NFS nolock option (line 236) disables file locking entirely. This combined with eventual consistency in GCS means there's no coordination mechanism for concurrent access. This could lead to: (1) corrupted files from concurrent writes, (2) partial reads of files being written, (3) race conditions in file creation/deletion. The design needs either locking support or very clear documentation of these limitations.

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+                    "createdAt": "2026-01-20T12:00:00Z"
+                  }
+                ]

claude bot Jan 20, 2026

The design doesn't specify what happens when a filesystem is deleted while it's still mounted in active sandboxes. This could lead to: (1) sandboxes experiencing I/O errors, (2) stale NFS mounts, (3) orphaned data. Consider adding reference counting or preventing deletion of in-use filesystems.

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+              ```typescript
+              // Create a filesystem
+              const filesystem = await Filesystem.create('my-volume', '5gb')

claude bot Jan 20, 2026

The SDK allows mounting by filesystem name ('my-volume') in addition to ID, but name resolution isn't specified. Since names are only unique within a team, the orchestrator needs to resolve names to IDs during sandbox creation. This adds latency and a potential point of failure. Consider whether name-based mounting is necessary for v0 or if ID-only would be simpler.

claude bot reviewed

View reviewed changes

docs/design/persistent-filesystem.md

+              ### NFS Proxy → GCS Mapping
+              | NFS Operation | GCS Operation |
+              |---------------|---------------|

claude bot Jan 20, 2026

NFS WRITE operations are mapped to GCS objects.insert, but GCS objects are immutable. Each write creates a new object version. For frequently modified files, this will: (1) generate excessive object versions consuming storage, (2) increase costs significantly, (3) require lifecycle policies for cleanup. Consider using objects.patch or implementing a write-back cache to batch updates.


          updates based on implementation

13f8835

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

claude[bot] claude[bot] left review comments

ValentaTomas Awaiting requested review from ValentaTomas ValentaTomas will be requested when the pull request is marked ready for review ValentaTomas is a code owner

jakubno Awaiting requested review from jakubno jakubno will be requested when the pull request is marked ready for review jakubno is a code owner

dobrac Awaiting requested review from dobrac dobrac will be requested when the pull request is marked ready for review dobrac is a code owner

At least 1 approving review is required to merge this pull request.

Labels

None yet