RudderStack's Python SDK lets you track and send the events from your Python applications to the specified destinations.

Refer to the SDK's GitHub codebase for the implementation-specific details.

Github Badge

SDK setup requirements

  1. Sign up to RudderStack Cloud.
  2. Set up a Python source in your dashboard. You should be able to see a write key for this source.

You will also need a data plane URL. Refer to the Dashboard Overview guide for more information on the data plane URL and where to find it.

The Setup tab in the RudderStack dashboard (seen above) has the SDK installation snippet containing both the write key and the data plane URL. Copy it to integrate the Python SDK into your application.

Installing the Python SDK

To install the RudderStack Python SDK using pip, run the following command:

pip install rudder-sdk-python

Initializing the RudderStack client

To initialize the SDK, run the following code snippet:

import rudderstack.analytics as rudder_analytics
rudder_analytics.write_key = WRITE_KEY
rudder_analytics.dataPlaneUrl = DATA_PLANE_URL

Initialization options

ParameterDescriptionDefault value
on_errorCallback for exception thrown while uploading the messages.None
debugThe SDK prints the logs if set to True.False
sendThe SDK does not send the data to the RudderStack backend if set to False.True
sync_modeThe SDK sends the data immediately instead of queueing it, if set to True.False
max_queue_sizeMaximum queue size the SDK uses to enqueue the events.10000
gzipThe SDK disables gzipping the event data if set to False.True
timeoutThe timeout for sending POST requests to the RudderStack backend.15
max_retriesMaximum number of retry requests the SDK makes to the RudderStack backend.10
upload_intervalMaximum duration between two upload (flush) activities.0.5s
upload_sizeNumber of events in the queue that triggers a flush.100

Sending events

RudderStack does not store or persist the user state in any of the server-side SDKs.

Unlike the client-side SDKs that deal with only a single user at a given time, the server-side SDKs deal with multiple users simultaneously. Therefore, you must specify either the user_id or anonymous_id every time while making any API calls supported by the Python SDK.

Identify

The identify call lets you identify a visiting user and associate them to their actions. It also lets you record the traits about them like their name, email address, etc.

A sample identify call made using the Python SDK is shown below:

rudder_analytics.identify('1hKOmRA4GRlm', {
'email': 'alex@example.com',
'name': 'John Doe',
'friends': 16
})

The identify method parameters are as shown:

FieldTypeDescription
user_id
Required, if anonymous_id is absent.
StringUnique identifier for a user in your database.
anonymous_id
Required, if user_id is absent.
StringThe SDK automatically sets this identifier in cases where there is no unique identifier for the user.
traits
Required
ObjectAn optional dictionary of the user's traits like name or email.
contextObjectAn optional dictionary of information that provides context about the event. It is not directly related to the API call.
integrationsObjectAn optional dictionary containing the destinations to be either enabled or disabled.
timestampTimestamp in ISO 8601 formatThe timestamp of the event's arrival.
message_idStringA unique identifier for the event. The SDK generates this field automatically. It is highly recommended to not send this field unless there is a specific requirement or use case.

Track

The track call lets you record the user actions along with their associated properties. Each user action is called an event.

A sample track call is shown below:

rudder_analytics.track('123456', 'Article Read', {
'title': 'The Independence',
'subtitle': 'Story of the Weak',
'author': 'John Doe'
})

The track method parameters are as shown:

FieldTypeDescription
user_id
Required, if anonymous_id is absent.
StringUnique identifier for a user in your database.
anonymous_id
Required, if user_id is absent.
StringThe SDK automatically sets this identifier in cases where there is no unique identifier for the user.
event
Required
StringName of the event.
properties
Required
ObjectAn optional dictionary of the properties associated with the event.
contextObjectAn optional dictionary of information that provides context about the event. It is not directly related to the API call.
integrationsObjectAn optional dictionary containing the destinations to be either enabled or disabled.
timestampTimestamp in ISO 8601 formatThe timestamp of the event's arrival.
message_idStringA unique identifier for the event. The SDK generates this field automatically. It is highly recommended to not send this field unless there is a specific requirement or use case.

Page

The page call lets you record the page views on your application along with the other relevant information about the page.

A sample page call is as shown:

rudder_analytics.page('userid', 'Documentation', 'Sample Doc', {
'url': 'http://rudderstack.com'
})

The page method parameters are as shown:

FieldTypeDescription
user_id
Required, if anonymous_id is absent.
StringUnique identifier for a user in your database.
anonymous_id
Required, if user_id is absent.
StringThe SDK automatically sets this identifier in cases where there is no unique identifier for the user.
name
Required
StringName of the viewed page.
category
Required
StringCategory of the viewed page.
properties
Required
ObjectAn optional dictionary of the properties associated with the event.
contextObjectAn optional dictionary of information that provides context about the event. It is not directly related to the API call.
integrationsObjectAn optional dictionary containing the destinations to be either enabled or disabled.
timestampTimestamp in ISO 8601 formatThe timestamp of the event's arrival.
message_idStringA unique identifier for the event. The SDK generates this field automatically. It is highly recommended to not send this field unless there is a specific requirement or use case.

Screen

The screen call is the mobile equivalent of the page call. It lets you record the screen views on your mobile app along with other relevant information about the screen.

A sample screen call is as shown:

rudder_analytics.screen('userid', 'Settings', 'Brightness', {
'from': 'Settings Screen'
})

The screen method parameters are as shown:

FieldTypeDescription
user_id
Required, if anonymous_id is absent.
StringUnique identifier for a user in your database.
anonymous_id
Required, if user_id is absent.
StringThe SDK automatically sets this identifier in cases where there is no unique identifier for the user.
name
Required
StringName of the viewed screen.
category
Required
StringCategory of the viewed screen.
properties
Required
ObjectAn optional dictionary of the properties associated with the event.
contextObjectAn optional dictionary of information that provides context about the event. It is not directly related to the API call.
integrationsObjectAn optional dictionary containing the destinations to be either enabled or disabled.
timestampTimestamp in ISO 8601 formatThe timestamp of the event's arrival.
message_idStringA unique identifier for the event. The SDK generates this field automatically. It is highly recommended to not send this field unless there is a specific requirement or use case.

Group

The group call lets you link an identified user with a group, such as a company, organization, or an account. It also lets you record any custom traits or properties associated with that group.

A sample group call made using the Python SDK is shown below:

rudder_analytics.group('1hKOmRA4GRlm', '12', {
'name': 'Company',
'domain': 'IT'
})

The group method parameters are as shown:

FieldTypeDescription
user_id
Required, if anonymous_id is absent.
StringUnique identifier for a user in your database.
anonymous_id
Required, if user_id is absent.
StringThe SDK automatically sets this identifier in cases where there is no unique identifier for the user.
group_id
Required
StringUnique identifier of the group in your database.
traitsObjectAn optional dictionary of the group's traits like name or email.
integrationsObjectAn optional dictionary containing the destinations to be either enabled or disabled.
contextObjectAn optional dictionary of information that provides context about the event. It is not directly related to the API call.
timestampTimestamp in ISO 8601 formatThe timestamp of the event's arrival.
message_idStringA unique identifier for the event. The SDK generates this field automatically. It is highly recommended to not send this field unless there is a specific requirement or use case.

Alias

The alias call lets you merge different identities of a known user. It is an advanced method that lets you change the tracked user's ID explicitly. You can use alias for managing the user's identity in some of the downstream destinations.

RudderStack supports sending alias events only to select downstream destinations. Refer to the destination-specific documentation for more details.

A sample alias call is as shown:

rudder_analytics.alias('previous_id', 'user_id')

The alias method parameters are as shown:

FieldTypeDescription
user_id
Required, if anonymous_id is absent.
StringUnique identifier for a user in your database.
previous_id
Required
StringThe previous unique identifier of the user.
integrationsObjectAn optional dictionary containing the destinations to be either enabled or disabled.
contextObjectAn optional dictionary of information that provides context about the event. It is not directly related to the API call.
timestampTimestamp in ISO 8601 formatThe timestamp of the event's arrival.
message_idStringA unique identifier for the event. The SDK generates this field automatically. It is highly recommended to not send this field unless there is a specific requirement or use case.

Gzipping requests

The Python SDK automatically gzips requests. However, you can disable this feature by setting the gzip parameter to false while initializing the SDK, as shown:

import rudderstack.analytics as rudder_analytics
rudder_analytics.write_key = WRITE_KEY
rudder_analytics.dataPlaneUrl = DATA_PLANE_URL
rudder_analytics.gzip = False
Gzip requires rudder-server v1.4 or higher. Otherwise, your events might fail.

Flushing events

The Python SDK batches the events and flushes them in the background, for faster and more efficient operation. By default, the SDK flushes a batch of 100 events every 0.5 seconds since the last flush.

You can control the event flushing by tweaking the following parameters:

ParameterDescriptionDefault value
max_queue_sizeMaximum queue size the SDK uses to enqueue the events.10000
upload_intervalMaximum duration between two upload (flush) activities.0.5s

You can also flush the events explicitly by using the SDK's flush() method to make sure no events are left in the queue.

A sample flush call is shown below:

rudder_analytics.flush()
The SDK blocks the calling thread until all the messages are flushed from the queue. Hence, you should avoid using it as a part of your request lifecycle.

Error handling

To handle errors that may occur when sending the events via the Python SDK, you can declare a callback called on_error.

def on_error(error, events):
print("Error response:", error)
rudder_analytics.on_error = on_error
This callback only returns the errors that occur with the HTTP requests to the gateway. It will not return any errors that occur while sending data to your downstream destinations.

Some of the common request responses are listed in the following table:

StatusCode
OK200
Request neither has anonymousId nor userId400
Invalid write key401
Invalid JSON400

Contact us

For more information on the topics covered on this page, email us or start a conversation in our Slack community.

On this page