Troubleshooting
Common issues and fixes for Hubify Labs — CLI, agents, pods, papers, and deployments.
Solutions for common issues across the CLI, web UI, agents, compute, and paper pipeline.
CLI Issues
<AccordionGroup>hubify: command not found
The CLI is not in your PATH. Fix depends on your install method:
# npm: add global bin to PATH
export PATH="$(npm config get prefix)/bin:$PATH"
# Add to your shell profile (~/.zshrc or ~/.bashrc)
echo 'export PATH="$(npm config get prefix)/bin:$PATH"' >> ~/.zshrc
source ~/.zshrc
Authentication expired
Tokens expire periodically. Re-authenticate:
hubify auth login
For CI environments, create a long-lived token:
hubify auth token --create --name "ci" --ttl 90d
Wrong lab selected
Commands run against the active lab. Switch labs:
# Check current lab
hubify lab info
# Switch
hubify lab use correct-lab-slug
# Or override per-command
hubify experiment list --lab correct-lab-slug
Network timeout errors
The CLI communicates with api.hubify.com over HTTPS. Check your network:
curl -I https://api.hubify.com/v1/health
If behind a proxy:
export HTTPS_PROXY=https://proxy.example.com:8080
</AccordionGroup>
Agent Issues
<AccordionGroup>Agent stuck or unresponsive
Check agent status:
hubify agent list
If an agent shows as stuck, restart it:
hubify agent update "Research Lead" --status active
Cross-model review failing
Verify external API keys are configured:
hubify agent review-config --show
If a provider's key is expired or invalid, update it:
hubify agent review-config --add-reviewer gpt-5.4 --api-key $NEW_OPENAI_KEY
Orchestrator not routing tasks
Ensure auto-scheduling is enabled:
hubify agent update orchestrator --auto-schedule true
Check if the orchestrator is overloaded:
hubify agent metrics orchestrator
</AccordionGroup>
Compute Issues
<AccordionGroup>Pod stuck in provisioning
The requested GPU may be out of stock. Check availability:
hubify pod list --available
Try a different GPU type or region:
hubify experiment rerun EXP-051 --override "pod=a100"
Out of memory (OOM) on GPU
Reduce batch size or upgrade GPU:
- H100 (80 GB) handles most workloads
- H200 (141 GB) for the largest datasets
For PyTorch, enable gradient checkpointing to reduce memory:
model.gradient_checkpointing_enable()
Idle pod wasting money
Find and terminate idle pods:
hubify pod list --json | jq '.[] | select(.status=="idle")'
hubify pod stop pod-idle-123
Prevent future waste by setting an idle timeout:
hubify pod config --idle-timeout 15m
Budget limit reached
When the monthly budget is hit, new experiments queue instead of launching:
# Check budget
hubify pod budget --show
# Increase limit
hubify pod budget --monthly 750
# Or wait until next month
</AccordionGroup>
Paper Issues
<AccordionGroup>Figures appear as empty boxes in PDF
Figures must be in the same directory as the .tex file. The compile command handles this, but verify:
hubify paper compile paper-1 --check
If the PDF is under 1 MB, figures are not embedded.
Undefined references in PDF
Run compilation multiple times (the compile command does this automatically):
hubify paper compile paper-1
If references persist, check that all \cite{} keys exist in your .bib file.
LaTeX compilation errors
Common issues:
- Using
\citep/\citet— use\cite{}with revtex4-2 - Using
\deluxetable— use\begin{table}\begin{ruledtabular}\begin{tabular} - Using
aastex631class — userevtex4-2
View the full compile log:
hubify paper compile paper-1 --verbose
</AccordionGroup>
Lab Site Issues
<AccordionGroup>Site not updating after experiment
Force a sync and redeploy:
hubify site sync
hubify site deploy
Custom domain not working
Verify DNS and SSL:
hubify site domain --status
Ensure your CNAME record points to cname.hubify.app. SSL provisioning takes 1-5 minutes.
Getting Help
If your issue is not listed here:
# Check system status
hubify status --system
# View debug logs
hubify logs --system --level debug --since "1h ago"
# Contact support
hubify support "Description of your issue"