Skip to main content

How to run an embedded vector database in 10 lines of code

· 3 min read
Dan Dascalescu

Yes, 10 Python lines of code, generously formatted with whitespace. Or 14 for TypeScript. Oh, and all your data stays private locally, and we don't charge you anything. We're also going to build a useful example, illustrating a testing scenario. Here's how.

1. Install the client library

The Python and TypeScript client libraries support running Weaviate embedded on Linux, and starting with versions 3.21.0 and 1.2.0 respectively, on macOS as well.

pip install weaviate-client  --upgrade

2. Run the code

Save as embedded.py and run python embedded.py:


import weaviate
import json

client = weaviate.Client(
embedded_options=weaviate.embedded.EmbeddedOptions(),
)
# END TestExample

uuid = client.data_object.create({
'hello': 'World!'
}, 'MyClass')

obj = client.data_object.get_by_id(uuid, class_name='MyClass')

print(json.dumps(obj, indent=2))

How does this work?

Essentially, what happens behind the scenes is that the client library downloads the server binary, spawns it in a separate process, connects to it, then terminates it on exit. The data is persisted, so you can use it from future invocations, or you can transfer it to another instance.

You can learn more about running Weaviate locally from client code on the Embedded Weaviate page.

Use cases

What can you do with Embedded Weaviate? Quite a few things!

First off, you can get started very quickly with Weaviate on your local machine, without having to explicitly download, install or instantiate a server.

Jupyter notebooks

You can also use Embedded Weaviate from Jupyter notebooks, including on Google Colaboratory. Here is the example above as Google Colab notebook.

Colab screenshot

Use Weaviate in CI/CD pipelines

You can use Embedded Weaviate in automated tests, where you can run integration tests without having to manage a separate server instance. Here is the example above slightly modified to perform similarity search and test that the added object was found.

Save as embedded_test.py and run pytest. (If you don't have pytest, run pip install pytest.)


import weaviate
import json

client = weaviate.Client(
embedded_options=weaviate.embedded.EmbeddedOptions(),
additional_headers={
'X-OpenAI-Api-Key': 'YOUR-OPENAI-API-KEY' # Replace with your OPENAI API key
}
)

client.schema.create_class({
'class': 'Wine',
'vectorizer': 'text2vec-openai',
})

client.data_object.create({
'name': 'Chardonnay',
'review': 'Goes well with fish!',
}, 'Wine')

response = (
client.query
.get('Wine', ['name', 'review'])
.with_near_text({
'concepts': ['great for seafood']
})
.do()
)

assert response['data']['Get']['Wine'][0]['review'] == 'Goes well with fish!'

Have you found other use cases for embedded Weaviate? Let us know in the comments below!

Ready to start building?

Check out the Quickstart tutorial, and begin building amazing apps with the free trial of Weaviate Cloud Services (WCS).

Don't want to miss another blog post?

Sign up for our bi-weekly newsletter to stay updated!
By submitting, I agree to the Terms of Service and Privacy Policy.