LLM finetuning & inference
Llama, Mistral, Qwen, DeepSeek finetuning with LoRA / QLoRA / full FT on H100. Or self-hosted inference with vLLM / TGI / Ollama for production model serving.
Offshore NVIDIA GPU servers in Moldova from $249/mo. RTX 4090, RTX 5090 and H100 SXM5 cards passed through KVM with full root. CUDA 12 + cuDNN preinstalled, PyTorch / ComfyUI / Ollama image presets ready to ssh into. Crypto-only checkout, no KYC, no email — just an account token.
Moldova is the budget GPU tier in our network. Same NVIDIA hardware, but lower electricity cost and minimal regulatory framework let us price GPU plans 10-15% below Romania and 15-25% below Iceland. Use this jurisdiction when cost-per-token matters more than peering or marketing posture.
All plans include CUDA 12 + cuDNN preinstalled, NVMe SSD, DDR5 RAM, full root access, SSH + JupyterLab and unlimited bandwidth.
| Plan | GPU | VRAM | CPU | RAM | NVMe | Bandwidth | Price | |
|---|---|---|---|---|---|---|---|---|
| MD-S | 1× NVIDIA RTX 4090 | 24 GB GDDR6X | 12 vCPU | 64 GB DDR5 | 1 TB NVMe | Unlimited | $249/mo | Order |
| MD-M Popular | 1× NVIDIA RTX 5090 | 32 GB GDDR7 | 16 vCPU | 96 GB DDR5 | 1.5 TB NVMe | Unlimited | $399/mo | Order |
| MD-L | 1× NVIDIA H100 SXM5 | 80 GB HBM3 | 24 vCPU | 192 GB DDR5 | 2 TB NVMe | Unlimited | $1699/mo | Order |
| MD-XL | 2× NVIDIA H100 SXM5 | 160 GB HBM3 | 32 vCPU | 384 GB DDR5 | 4 TB NVMe | Unlimited | $3199/mo | Order |
GPU servers shine on workloads that scale with VRAM and tensor cores — LLM finetuning and inference, diffusion image generation, AI video, and high-throughput model serving.
Llama, Mistral, Qwen, DeepSeek finetuning with LoRA / QLoRA / full FT on H100. Or self-hosted inference with vLLM / TGI / Ollama for production model serving.
Stable Diffusion, FLUX.1, SDXL with ComfyUI or Forge. Train your own LoRA, batch-generate at scale, or self-host an inference endpoint.
OpenSora, CogVideoX, Wan-2.1, AnimateDiff. Video generation needs serious VRAM — start at RTX 5090 (32 GB) or H100 (80 GB).
Deploy fine-tuned models behind your own API. Predictable costs, no per-token fees, no data leaving your jurisdiction. JupyterLab + FastAPI included.
RTX 4090 (24 GB), RTX 5090 (32 GB), H100 SXM5 (80 GB), 2× H100 (160 GB).
Up to 4 TB NVMe SSD, paired with DDR5 RAM for fast dataset I/O.
From paid order to nvidia-smi output in under 60 seconds.
Full root SSH, plus pre-bound JupyterLab on port 8888 with token auth.
Yes. While less well-known than Iceland or Switzerland, Moldova offers solid infrastructure with European peering. Its light regulatory environment and low costs make it a strong value proposition for offshore hosting.
Our Moldova VPS plans start at $14.99/mo for 2 vCPU, 4GB DDR4 RAM, 60GB NVMe, and unlimited bandwidth. This is the most affordable option in our network.
Moldova has very limited judicial cooperation with western countries. There are no binding data-sharing agreements with the US or most EU members that would affect hosting providers.
Pay in BTC, XMR, ETH, USDT or 10 other chains. SSH + JupyterLab on a real NVIDIA GPU in Moldova in under 60 seconds.