GitHub - kaiyun717/ray: An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

https://readthedocs.org/projects/ray/badge/?version=master

https://img.shields.io/badge/Ray-Join%20Slack-blue

https://img.shields.io/badge/Discuss-Ask%20Questions-blue

Ray provides a simple, universal API for building distributed applications.

Ray is packaged with the following libraries for accelerating machine learning workloads:

Tune: Scalable Hyperparameter Tuning
RLlib: Scalable Reinforcement Learning
Train: Distributed Deep Learning (beta)
Datasets: Distributed Data Loading and Compute (beta)

As well as libraries for taking ML and distributed apps to production:

Serve: Scalable and Programmable Serving
Workflows: Fast, Durable Application Flows (alpha)

There are also many community integrations with Ray, including Dask, MARS, Modin, Horovod, Hugging Face, Scikit-learn, and others. Check out the full list of Ray distributed libraries here.

Install Ray with: pip install ray. For nightly wheels, see the Installation page.

Quick Start

Execute Python functions in parallel.

import ray
ray.init()

@ray.remote
def f(x):
    return x * x

futures = [f.remote(i) for i in range(4)]
print(ray.get(futures))

To use Ray's actor model:

import ray
ray.init()

@ray.remote
class Counter(object):
    def __init__(self):
        self.n = 0

    def increment(self):
        self.n += 1

    def read(self):
        return self.n

counters = [Counter.remote() for i in range(4)]
[c.increment.remote() for c in counters]
futures = [c.read.remote() for c in counters]
print(ray.get(futures))

Ray programs can run on a single machine, and can also seamlessly scale to large clusters. To execute the above Ray script in the cloud, just download this configuration file, and run:

ray submit [CLUSTER.YAML] example.py --start

Tune Quick Start

Tune is a library for hyperparameter tuning at any scale.

Launch a multi-node distributed hyperparameter sweep in less than 10 lines of code.
Supports any deep learning framework, including PyTorch, PyTorch Lightning, TensorFlow, and Keras.
Visualize results with TensorBoard.
Choose among scalable SOTA algorithms such as Population Based Training (PBT), Vizier's Median Stopping Rule, HyperBand/ASHA.
Tune integrates with many optimization libraries such as Facebook Ax, HyperOpt, and Bayesian Optimization and enables you to scale them transparently.

To run this example, you will need to install the following:

$ pip install "ray[tune]"

This example runs a parallel grid search to optimize an example objective function.

from ray import tune


def objective(step, alpha, beta):
    return (0.1 + alpha * step / 100)**(-1) + beta * 0.1


def training_function(config):
    # Hyperparameters
    alpha, beta = config["alpha"], config["beta"]
    for step in range(10):
        # Iterative training function - can be any arbitrary training procedure.
        intermediate_score = objective(step, alpha, beta)
        # Feed the score back back to Tune.
        tune.report(mean_loss=intermediate_score)


analysis = tune.run(
    training_function,
    config={
        "alpha": tune.grid_search([0.001, 0.01, 0.1]),
        "beta": tune.choice([1, 2, 3])
    })

print("Best config: ", analysis.get_best_config(metric="mean_loss", mode="min"))

# Get a dataframe for analyzing trial results.
df = analysis.results_df

If TensorBoard is installed, automatically visualize all trial results:

tensorboard --logdir ~/ray_results

RLlib Quick Start

RLlib is an industry-grade library for reinforcement learning (RL), built on top of Ray. It offers high scalability and unified APIs for a variety of industry- and research applications.

$ pip install "ray[rllib]" tensorflow  # or torch

import gym
from ray.rllib.agents.ppo import PPOTrainer


# Define your problem using python and openAI's gym API:
class SimpleCorridor(gym.Env):
    """Corridor in which an agent must learn to move right to reach the exit.

    ---------------------
    | S | 1 | 2 | 3 | G |   S=start; G=goal; corridor_length=5
    ---------------------

    Possible actions to chose from are: 0=left; 1=right
    Observations are floats indicating the current field index, e.g. 0.0 for
    starting position, 1.0 for the field next to the starting position, etc..
    Rewards are -0.1 for all steps, except when reaching the goal (+1.0).
    """

    def __init__(self, config):
        self.end_pos = config["corridor_length"]
        self.cur_pos = 0
        self.action_space = gym.spaces.Discrete(2)  # left and right
        self.observation_space = gym.spaces.Box(0.0, self.end_pos, shape=(1,))

    def reset(self):
        """Resets the episode and returns the initial observation of the new one.
        """
        self.cur_pos = 0
        # Return initial observation.
        return [self.cur_pos]

    def step(self, action):
        """Takes a single step in the episode given `action`

        Returns:
            New observation, reward, done-flag, info-dict (empty).
        """
        # Walk left.
        if action == 0 and self.cur_pos > 0:
            self.cur_pos -= 1
        # Walk right.
        elif action == 1:
            self.cur_pos += 1
        # Set `done` flag when end of corridor (goal) reached.
        done = self.cur_pos >= self.end_pos
        # +1 when goal reached, otherwise -1.
        reward = 1.0 if done else -0.1
        return [self.cur_pos], reward, done, {}


# Create an RLlib Trainer instance.
trainer = PPOTrainer(
    config={
        # Env class to use (here: our gym.Env sub-class from above).
        "env": SimpleCorridor,
        # Config dict to be passed to our custom env's constructor.
        "env_config": {
            # Use corridor with 20 fields (including S and G).
            "corridor_length": 20
        },
        # Parallelize environment rollouts.
        "num_workers": 3,
    })

# Train for n iterations and report results (mean episode rewards).
# Since we have to move at least 19 times in the env to reach the goal and
# each move gives us -0.1 reward (except the last move at the end: +1.0),
# we can expect to reach an optimal episode reward of -0.1*18 + 1.0 = -0.8
for i in range(5):
    results = trainer.train()
    print(f"Iter: {i}; avg. reward={results['episode_reward_mean']}")

After training, you may want to perform action computations (inference) in your environment. Here is a minimal example on how to do this. Also check out our more detailed examples here (in particular for normal models, LSTMs, and attention nets).

# Perform inference (action computations) based on given env observations.
# Note that we are using a slightly different env here (len 10 instead of 20),
# however, this should still work as the agent has (hopefully) learned
# to "just always walk right!"
env = SimpleCorridor({"corridor_length": 10})
# Get the initial observation (should be: [0.0] for the starting position).
obs = env.reset()
done = False
total_reward = 0.0
# Play one episode.
while not done:
    # Compute a single action, given the current observation
    # from the environment.
    action = trainer.compute_single_action(obs)
    # Apply the computed action in the environment.
    obs, reward, done, info = env.step(action)
    # Sum up rewards for reporting purposes.
    total_reward += reward
# Report results.
print(f"Played 1 episode; total-reward={total_reward}")

Ray Serve Quick Start

Ray Serve is a scalable model-serving library built on Ray. It is:

Framework Agnostic: Use the same toolkit to serve everything from deep learning models built with frameworks like PyTorch or Tensorflow & Keras to Scikit-Learn models or arbitrary business logic.
Python First: Configure your model serving declaratively in pure Python, without needing YAMLs or JSON configs.
Performance Oriented: Turn on batching, pipelining, and GPU acceleration to increase the throughput of your model.
Composition Native: Allow you to create "model pipelines" by composing multiple models together to drive a single prediction.
Horizontally Scalable: Serve can linearly scale as you add more machines. Enable your ML-powered service to handle growing traffic.

To run this example, you will need to install the following:

$ pip install scikit-learn
$ pip install "ray[serve]"

This example runs serves a scikit-learn gradient boosting classifier.

import pickle
import requests

from sklearn.datasets import load_iris
from sklearn.ensemble import GradientBoostingClassifier

from ray import serve

serve.start()

# Train model.
iris_dataset = load_iris()
model = GradientBoostingClassifier()
model.fit(iris_dataset["data"], iris_dataset["target"])

@serve.deployment(route_prefix="/iris")
class BoostingModel:
    def __init__(self, model):
        self.model = model
        self.label_list = iris_dataset["target_names"].tolist()

    async def __call__(self, request):
        payload = await request.json()["vector"]
        print(f"Received flask request with data {payload}")

        prediction = self.model.predict([payload])[0]
        human_name = self.label_list[prediction]
        return {"result": human_name}


# Deploy model.
BoostingModel.deploy(model)

# Query it!
sample_request_input = {"vector": [1.2, 1.0, 1.1, 0.9]}
response = requests.get("http://localhost:8000/iris", json=sample_request_input)
print(response.text)
# Result:
# {
#  "result": "versicolor"
# }

More Information

Documentation
Tutorial
Blog
Ray 1.0 Architecture whitepaper (new)
Ray Design Patterns (new)
RLlib paper
RLlib flow paper
Tune paper

Older documents:

Getting Involved

Forum: For discussions about development, questions about usage, and feature requests.
GitHub Issues: For reporting bugs.
Twitter: Follow updates on Twitter.
Slack: Join our Slack channel.
Meetup Group: Join our meetup group.
StackOverflow: For questions about how to use Ray.

Name	Name	Last commit message	Last commit date
Latest commit ericl [data] Stage fusion optimizations, off by default (ray-project#22373 ) Feb 17, 2022 786c575 · Feb 17, 2022 History 11,333 Commits
.buildkite	.buildkite	[ci/release] Refactor release test e2e into package (ray-project#22351 )	Feb 16, 2022
.github	.github	Add more codeowners to datasets (ray-project#22409 )	Feb 15, 2022
.gitpod	.gitpod	[CI] Add support for Black formatting (ray-project#21281 )	Jan 3, 2022
bazel	bazel	update grpc to 1.43 (ray-project#21866 )	Feb 15, 2022
benchmarks	benchmarks	[CI] Format Python code with Black (ray-project#21975 )	Jan 30, 2022
ci	ci	[runtime env] Support clone `virtualenv` from an existing `virtualenv` (	Feb 15, 2022
cpp	cpp	[C++ Worker]fix cpp api test (ray-project#22232 )	Feb 10, 2022
dashboard	dashboard	Revert "[serve] Add basic REST API to dashboard (ray-project#22257 )" (r…	Feb 16, 2022
deploy	deploy	Prep K8s operator for the Ray 1.11.0 release. (ray-project#22264 )	Feb 10, 2022
doc	doc	[doc][Java] Add doc page for java concurrency group. (ray-project#21600 )	Feb 16, 2022
docker	docker	[Docker] Update echo in fix-docker-latest.sh (ray-project#22123 )	Feb 7, 2022
java	java	[Java] Add Java release guideline. (ray-project#22288 )	Feb 11, 2022
python	python	[data] Stage fusion optimizations, off by default (ray-project#22373 )	Feb 17, 2022
release	release	[Release 1.11.0] Release logs for 1.11.0rc1 (ray-project#22443 )	Feb 17, 2022
rllib	rllib	[RLlib] Bug fix: eval-workers in offline RL setup have no env, even t…	Feb 15, 2022
src	src	Revert "[Client] chunked get requests" (ray-project#22455 )	Feb 17, 2022
thirdparty	thirdparty	update grpc to 1.43 (ray-project#21866 )	Feb 15, 2022
.bazelrc	.bazelrc	[Build] Remove debug info from Ray libraries. (ray-project#20389 )	Nov 16, 2021
.clang-format	.clang-format	Remove legacy Ray code. (ray-project#3121 )	Oct 26, 2018
.clang-tidy	.clang-tidy	[Lint] Disable `modernize-use-override` (ray-project#19368 )	Oct 14, 2021
.editorconfig	.editorconfig	Improve .editorconfig entries (ray-project#7344 )	Feb 27, 2020
.flake8	.flake8	[Streaming]Farewell : remove all of streaming related from ray repo. (r…	Jan 23, 2022
.gitignore	.gitignore	[Streaming]Farewell : remove all of streaming related from ray repo. (r…	Jan 23, 2022
.gitpod.yml	.gitpod.yml	[dev] Enable gitpod (ray-project#15420 )	Apr 21, 2021
BUILD.bazel	BUILD.bazel	Fix building Windows wheels (ray-project#22388 ) (ray-project#22391 )	Feb 15, 2022
CONTRIBUTING.rst	CONTRIBUTING.rst	Link to the documentation on contributing from CONTRIBUTING.rst (ray-…	Nov 15, 2021
LICENSE	LICENSE	Update license for MLflow's conda utils and virtualenv-clone (ray-pro…	Feb 16, 2022
README.rst	README.rst	[RLlib; Documentation] RLlib README overhaul. (ray-project#20249 )	Nov 18, 2021
SECURITY.md	SECURITY.md	Create SECURITY.md (ray-project#21521 )	Jan 11, 2022
WORKSPACE	WORKSPACE	[BUILD] Use bazel-skylib rule to check bazel version (ray-project#20990 )	Dec 9, 2021
build-docker.sh	build-docker.sh	[Docker] Add --base-image argument to build-docker.sh (ray-project#17574	Aug 13, 2021
build.sh	build.sh	Get rid of build shell scripts and move them to Python (ray-project#6082	Jul 16, 2020
pylintrc	pylintrc	RLLIB and pylintrc (ray-project#8995 )	Jun 17, 2020
scripts	scripts	Add scripts symlink back (ray-project#9219 ) (ray-project#9475 )	Jul 14, 2020
setup_hooks.sh	setup_hooks.sh	Shellcheck quoting (ray-project#9596 )	Jul 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quick Start

Tune Quick Start

RLlib Quick Start

Ray Serve Quick Start

More Information

Getting Involved

About

Releases

Packages

Languages

License

kaiyun717/ray

Folders and files

Latest commit

History

Repository files navigation

Quick Start

Tune Quick Start

RLlib Quick Start

Ray Serve Quick Start

More Information

Getting Involved

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages