A different cost floor, not a discount
On fal, your cost floor is set by H100 / H200 / A100 / B200 cloud rental rates × runtime — a function of centralized hardware economics. deAPI sources GPU-seconds from a global pool of independent providers competing for inference work, which reports inference cost reductions of up to 20× versus traditional cloud APIs.