Telemetry¶

Added in version 0.5.0.

Platzky includes OpenTelemetry integration for distributed tracing and performance monitoring. This helps identify bottlenecks in your application, whether running on Google App Engine, Kubernetes, or any other platform.

Why Telemetry?¶

Telemetry gives you visibility into:

Request latency and throughput
Database query performance
External API call durations
Bottlenecks in your application code

This is especially valuable when running in production environments where traditional debugging isn’t available.

Installation¶

Telemetry support requires optional dependencies:

$ pip install platzky[telemetry]

Or with Poetry:

$ poetry install -E telemetry

This installs:

opentelemetry-api - Core OpenTelemetry API
opentelemetry-sdk - OpenTelemetry SDK
opentelemetry-instrumentation-flask - Automatic Flask instrumentation
opentelemetry-instrumentation-logging - Automatic logging instrumentation
opentelemetry-instrumentation-pymongo - Automatic MongoDB instrumentation
opentelemetry-instrumentation-requests - Automatic HTTP client instrumentation
opentelemetry-exporter-otlp - OTLP exporter for sending traces

Configuration¶

Enable telemetry in your config.yml:

TELEMETRY:
  enabled: true
  endpoint: http://localhost:4317

Configuration Options¶

`enabled`¶

Type:: bool
Default:: False

Enable or disable telemetry collection.

`endpoint`¶

Type:: str
Default:: None

OTLP gRPC endpoint for exporting traces. This endpoint should point to your OpenTelemetry collector or observability backend. Must be a valid http:// or https:// URL.

`console_export`¶

Type:: bool
Default:: False

Export traces to console for debugging purposes. Useful during development to see traces without setting up a collector.

`timeout`¶

Type:: int
Default:: 10

Timeout in seconds for the exporter. Must be greater than 0.

`deployment_environment`¶

Type:: str
Default:: None

Deployment environment name (e.g., “production”, “staging”, “dev”). This is added as a resource attribute to help filter traces by environment.

`service_instance_id`¶

Type:: str
Default:: Auto-generated (hostname-uuid)

Service instance identifier. If not provided, an ID is automatically generated using the hostname and a short UUID.

`instrument_logging`¶

Type:: bool
Default:: True

Enable automatic logging instrumentation. When enabled, trace context (trace ID, span ID) is automatically added to log records, allowing you to correlate logs with traces in your observability platform.

The trace context is added as attributes to log records without modifying your existing log format. You can access these attributes in custom log formatters:

otelTraceID - The trace ID for the current request
otelSpanID - The span ID for the current operation
otelServiceName - The service name

Example custom formatter that includes trace context:

import logging

# Define a custom formatter that includes trace context
formatter = logging.Formatter(
    '%(asctime)s - %(name)s - %(levelname)s - '
    '[trace_id=%(otelTraceID)s span_id=%(otelSpanID)s] - '
    '%(message)s'
)

# Apply to your handlers
handler = logging.StreamHandler()
handler.setFormatter(formatter)
logging.getLogger().addHandler(handler)

What Gets Traced?¶

When telemetry is enabled, Platzky automatically instruments:

Flask Requests¶

Every HTTP request is traced with:

Request method and path
Response status code
Request duration
Query parameters and headers (configurable)

MongoDB Queries¶

If you’re using MongoDB, all database operations are traced:

Query operations (find, insert, update, delete)
Database and collection names
Query duration

HTTP Requests¶

Outgoing HTTP requests made with the requests library are traced:

URL and method
Response status code
Request duration

Deployment Examples¶

Local Development with Jaeger¶

Run Jaeger locally with Docker:

$ docker run -d --name jaeger \
  -p 4317:4317 \
  -p 16686:16686 \
  jaegertracing/all-in-one:latest

Configure Platzky:

TELEMETRY:
  enabled: true
  endpoint: http://localhost:4317

View traces at http://localhost:16686

Kubernetes with Grafana Tempo¶

Deploy Grafana Tempo in your cluster, then configure:

TELEMETRY:
  enabled: true
  endpoint: http://tempo-distributor.monitoring.svc.cluster.local:4317

Google App Engine¶

Use Google Cloud’s OpenTelemetry collector:

Deploy the OpenTelemetry Collector to your GCP project
Configure Platzky:

TELEMETRY:
  enabled: true
  endpoint: http://opentelemetry-collector:4317

View traces in Google Cloud Trace console.

AWS with X-Ray¶

Use AWS Distro for OpenTelemetry:

TELEMETRY:
  enabled: true
  endpoint: http://localhost:4317  # ADOT collector

View traces in AWS X-Ray console.

Analyzing Traces¶

Once telemetry is collecting data, you can:

Identify Slow Requests¶

Look for HTTP request spans with high duration. The trace will show you:

Which route is slow
What’s causing the slowness (database query, external API, etc.)

Optimize Database Queries¶

MongoDB query spans show:

Query duration
Which queries are most frequent
N+1 query patterns

Find External API Bottlenecks¶

HTTP client spans reveal:

Which external APIs are slow
Timeout issues
Rate limiting problems

Best Practices¶

Development vs Production¶

Consider disabling telemetry in development to reduce noise:

# config-dev.yml
TELEMETRY:
  enabled: false

# config-prod.yml
TELEMETRY:
  enabled: true
  endpoint: http://tempo-collector:4317

Sampling¶

In high-traffic applications, consider configuring sampling at the collector level to reduce overhead and costs. Most observability platforms support trace sampling.

Privacy Considerations¶

Be aware that traces may contain:

Request URLs (which might include sensitive parameters)
Database query details
Response data

Configure your instrumentation appropriately for your privacy requirements.

Troubleshooting¶

No Traces Appearing¶

Verify telemetry dependencies are installed:
```
$ pip list | grep opentelemetry
```
Check the OTLP endpoint is reachable:
```
$ telnet tempo-collector 4317
```
Look for OpenTelemetry warnings in application logs

High Overhead¶

If telemetry is causing performance issues:

Verify you’re using an async exporter (OTLP uses async by default)
Configure sampling at the collector level
Check network latency to your OTLP endpoint

Platzky

Navigation

Related Topics

Telemetry¶

Why Telemetry?¶

Installation¶

Configuration¶

Configuration Options¶

`enabled`¶

`endpoint`¶

`console_export`¶

`timeout`¶

`deployment_environment`¶

`service_instance_id`¶

`instrument_logging`¶

What Gets Traced?¶

Flask Requests¶

MongoDB Queries¶

HTTP Requests¶

Deployment Examples¶

Local Development with Jaeger¶

Kubernetes with Grafana Tempo¶

Google App Engine¶

AWS with X-Ray¶

Analyzing Traces¶

Identify Slow Requests¶

Optimize Database Queries¶

Find External API Bottlenecks¶

Best Practices¶

Development vs Production¶

Sampling¶

Privacy Considerations¶

Troubleshooting¶

No Traces Appearing¶

High Overhead¶

Further Reading¶

Telemetry¶

Why Telemetry?¶

Installation¶

Configuration¶

Configuration Options¶

enabled¶

endpoint¶

console_export¶

timeout¶

deployment_environment¶

service_instance_id¶

instrument_logging¶

What Gets Traced?¶

Flask Requests¶

MongoDB Queries¶

HTTP Requests¶

Deployment Examples¶

Local Development with Jaeger¶

Kubernetes with Grafana Tempo¶

Google App Engine¶

AWS with X-Ray¶

Analyzing Traces¶

Identify Slow Requests¶

Optimize Database Queries¶

Find External API Bottlenecks¶

Best Practices¶

Development vs Production¶

Sampling¶

Privacy Considerations¶

Troubleshooting¶

No Traces Appearing¶

High Overhead¶

Further Reading¶

`enabled`¶

`endpoint`¶

`console_export`¶

`timeout`¶

`deployment_environment`¶

`service_instance_id`¶

`instrument_logging`¶