- Login to our workspace (named
Sourcegraph). If you don’t have an account, post in #it-tech-ops or request one during login.
- Watch this short walkthrough by @ericbm with a couple members of our marketing team. It goes over the data in Amplitude and how to build many different kinds of analyses. You can also read the Amplitude documentation for building analyses and/or check out some of our tutorials.
- Look at all the Team Spaces and join whichever ones are relevant to you. Learn more about Team Spaces here.
- Read the Sourcegraph <> Amplitude FAQs, Amplitude help center or Ask BizOps for help if you have any questions. As a reminder, you can always post in #analytics-review if you have a work-in-progress analysis you want someone else’s eyes on.
Amplitude is a product analytics tool that specializes in turning event data into actionable insights and dashboards. Any question about how our product is used is probably easiest to answer in Amplitude because they’re built specifically to answer these questions.
Looker is very flexible in that we have the ability to set it up for any purpose we’d like. The downside of this is that to provide a great self-service experience for questions about product usagee, this would take a LOT of work. We get these capabilities out of the box with Amplitude.
Anything not based directly on analyzing Sourcegraph Cloud events is in Looker. This includes pings from on-prem instances, anything we get from the Cloud Postgres database and any data from third-parties tools (such as Google Analytics and Salesforce).
|Type of analysis||Tool||Example|
|Adoption of public code||Looker||Link|
|Progress towards OKRs||Looker||Link|
|Signup funnel on Cloud||Amplitude||Link|
Any analysis that was conducted in Looker prior to the implementation of Amplitude can still be done in Looker, and existing Looker dashboards and visualizations will still be maintained. Amplitude will help us conduct new and different analysis regarding product analytics.
Most Sourcegraph Cloud events are being sent to Amplitude. The events not being sent are extremely low traffic events, non-UI events (e.g. backend events for a search that we capture) or ones we have explicitly decided to exclude (such as code insights events because we’re focused on enterprise and Cloud data won’t inform any decisions). The full data map is in Drive.
The data is currently backfilled from .
Event properties are attributes of a particular event. These are added to the
amplitude_events_v5 table as individual columns from this scheduled query. For example, here’s a snippet that extracts event properties from both the argument field of an event, as well as an event itself.
The steps to adding additional event properties to the query/table:
- Create a PR to update the
amplitude_events_v5scheduled query with the new event properties.
- Once another member of the team approves, open a blank BigQuery query editor, run the query, and then save the results to a new test table (e.g. amplitude_test_20210812).
- Create an issue for Data Engineering to upload a sample of the new test table to a test project in Amplitude.
- If the data shows up as expected in Amplitude, go ahead and update the
amplitude_events_v5query with your PR.
- Update the INSERT statement with the new event properties. You can’t backfill event properties to events that already exist in Amplitude; however, you can backfill event properties if you’re backfilling events also.
If you’re adding a new event property, please add it to the data map.
JSON_EXTRACT(argument, '$.code_search.results.results_count') AS results_count, JSON_EXTRACT(argument, '$.code_search.results.alert') AS alert, CASE WHEN name = 'ViewRepogroup:python' THEN 'python' WHEN name = 'ViewRepogroup:golang' THEN 'golang' WHEN name = 'ViewRepogroup:android' THEN 'android' WHEN name = 'ViewRepogroup:kubernetes' THEN 'kubernetes' WHEN name = 'ViewRepogroup:kubernetes' THEN 'kubernetes' WHEN name = 'ViewRepogroup:kubernetes' THEN 'kubernetes' WHEN name = 'ViewRepogroup:stanford' THEN 'stanford' WHEN name = 'ViewRepogroup:cncf' THEN 'cncf' ELSE NULL END repogroup_name,
User properties are the attributes of individual users. The
amplitude_user_characteristics table defines all user properties, and this table is joined in with every event that is passed to Amplitude. These user properties include every A/B test from
ab_test_users. See more in the A/B testing doc (coming soon).
If an additional user property is added to this table, it will only be applied to events and users going forward. User properties are sent to Amplitude along with events, so new properties will only be added when a user triggers an event that is sent to Amplitude. When adding a user property, please add it to the data map
Amplitude is built on top of our existing eventLogger infrastructure, so we consider an event to be anything logged by this. We use object action framework in Proper Case for our naming. For example, in eventLogger this would show up as ‘SearchSubmitted’, and in Amplitude it’s converted to ‘Search Submitted’.
All events from eventLogger are sent to Amplitude except if explicitly added to a denylist in the scheduled query.
Amplitude Govern allows us to manage our data within Amplitude itself (opposed to within data pipelines). The ways we currently use Govern are:
- Any new event is blocked from being visible in Amplitude until it’s adapted (if necessary) to fit the following Amplitude “rules” and then unblocked by someone on the DataOps team
- The event name is converted to object action framework in Proper Case
- The event is set to active/inactive to follow our Cloud active user definitions
- The event is grouped in the appropriate category (such as
Search and navigationor
- Events can be merged if they represent the same action. For example, we deprecated an event that represented a sign-up in favor of a new event. In Govern, we can merge these two events to continue tracking sign-up events over time.
Each A/B test has a user property where the A/B test is true or false (true = they saw the variant, false = they saw the original). See the page on A/B testing for more information about experimentation at Sourcegraph.
Most times you can search in the ‘Select event’ dropdown and you’ll find the event you’re looking for. If you can’t, or are unsure what the Amplitude event corresponds to, keep reading!
To figure out an event’s name is in code or to confirm whether an existing event in Amplitude is the correct one, follow these steps (or watch this video) :
To find what a button, page view or action is named
- Open Safari
- Enable developer mode (preferences -> advanced)
- Go to sourcegraph.com
localStorage.eventLogDebug = "true"
- Start using Sourcegraph and click/view the event you’re looking for!
There are three levels to where a chart can be saved.
- To create a private chart, uncheck ‘Make Discoverable’ in the chart’s dropdown (“More”).
- To create a chart that is discoverable by other teammates but not as easily accessible unless they search (or you send them a link), just create a chart. This is Amplitude’s default.
- Add charts to Team Spaces if you want them to be easily accessible to teammates.
Team Spaces are a collection of charts, dashboards and notebooks that are relevant to a specific group (Amplitude’s version of a folder system). By joining one, you’ll be notified of every new analysis that a teammate adds to the space (and they’ll be notified of yours, as well). You can see all of our Team Spaces here.