Luma Ray flash 2 (720p) Serverless API

Generate stunning 720p videos from text with the Luma ray-flash-2-720p model. Faster & cheaper than Ray 2, offering realistic motion & detail.

~59.93s
POST /v2/ray-flash-2-720p · submit + poll
 1# pip install "segmind>=1.1.0"
 2# export SEGMIND_API_KEY="YOUR_API_KEY"
 3from segmind import SegmindClient, InferenceFailed, InferenceTimeout
 4
 5# Async (v2) — recommended for long-running / video models.
 6# run() blocks up to 600s; submit_async + job.wait(timeout=...) sets a longer
 7# deadline and keeps the request_id so you can re-poll later.
 8client = SegmindClient()                      # reads SEGMIND_API_KEY
 9payload = {
10    "loop": False,
11    "prompt": "a young boy riding a red bicycle in grand canyon valleys",
12    "duration": "5",
13    "aspect_ratio": "16:9",
14}
15job = client.submit_async("ray-flash-2-720p", **payload)
16print(job.request_id)                         # available immediately
17try:
18    result = job.wait(timeout=900, interval=2.0)
19    print(result["status"])                  # COMPLETED
20    print(result.get("output"))              # model output (e.g. video URL)
21except InferenceTimeout as e:
22    print("still running:", e.request_id)    # re-poll later with this id
23except InferenceFailed as e:
24    print("failed:", e.detail)
25
26# Fast models (<=600s) can use the one-liner instead:
27# result = segmind.run("ray-flash-2-720p", **payload)

API Endpoint

POSThttps://api.segmind.com/v1/ray-flash-2-720p

Parameters

promptrequired
string

Text prompt for video generation

Default: "a young boy riding a red bicycle in grand canyon valleys"
aspect_ratiooptional
string

Aspect ratio of the video.

Default: "16:9"
Allowed values :
"1:1""3:4""4:3""9:16""16:9""9:21""21:9"
durationoptional
integer

Duration of the output.

Default: 5
Allowed values :
59
end_image_urloptional
string (uri)

URL of an image to use as the ending frame

Default: null
loopoptional
boolean

Whether the video should loop, with the last frame matching the first frame for smooth, continuous playback.

Default: false
start_image_urloptional
string (uri)

URL of an image to use as the starting frame

Default: null

Response Type

Returns: Video

Asynchronous requests (v2)

Use Async for video, long-running (>~60s), or high-concurrency workloads; Sync is simplest for fast image & LLM calls. Async submits a request and you poll it to completion.

  1. 1
    POST /v2/ray-flash-2-720p

    Submitreturns request_id, status_url, response_url

  2. 2
    GET /v2/requests/{id}/status

    Polluntil COMPLETED or FAILED

  3. 3
    GET /v2/requests/{id}

    Resultfinal response body

Status states

QUEUEDAccepted, waiting for a worker
PROCESSINGRunning on a worker
COMPLETEDDone — result body is ready
FAILEDErrored (incl. content/RAI blocks)
  • A FAILED request is served as HTTP 422 — the body still carries the error detail.
  • An unknown or expired request_id returns HTTP 404.
  • Results are retained for 1 hour, then expire.
  • Content / RAI blocks surface as FAILED, not a separate state.
  • Track completion by polling the status endpoint.

Common Error Codes

The API returns standard HTTP status codes. Detailed error messages are provided in the response body.

400

Bad Request

Invalid parameters or request format

401

Unauthorized

Missing or invalid API key

403

Forbidden

Insufficient permissions

404

Not Found

Model or endpoint not found

406

Insufficient Credits

Not enough credits to process request

429

Rate Limited

Too many requests

500

Server Error

Internal server error

502

Bad Gateway

Service temporarily unavailable

504

Timeout

Request timed out