Analytics Open Tasks

Help.

114124 --- DISCUSSED BELOW --- Backlog 2016-04-02 2017-07-13
130651 EventStreams Backlog 2016-04-02 2017-07-10
114675 Sanitize pageview_hourly - subtasked {mole} Backlog 2016-04-02 2016-04-21
130256 Wikistats 2.0. Backlog 2016-04-02 2017-07-12
132256 Analytics hosts showed high temperature alarms Backlog 2016-04-10 2017-06-23
135251 Investigate requests flagged as pageview in analytics header coming from bots Backlog 2016-05-14 2017-06-15
143927 Global Unique Devices Counts Backlog 2016-08-26 2017-05-05
143924 Replacing standard edit metrics in dashiki with data from new edit data depot Backlog 2016-08-26 2016-12-21
152015 Provision new Kafka cluster(s) with security features Backlog 2016-12-01 2017-07-17
152712 Replacement of stat1002 and stat1003 Backlog 2016-12-09 2017-07-17
153923 vet metrics calculated from the data lake Backlog 2016-12-22 2017-05-25
155497 Measure Community Backlog. Backlog 2017-01-18 2017-07-17
156384 Backend for wikistats 2.0 Backlog 2017-01-27 2017-07-17
156841 Hadoop: Add a lower priority queue: nice queue Backlog 2017-02-01 2017-07-14
156933 Improve purging for analytics-slave data on Eventlogging Backlog 2017-02-02 2017-07-11
157435 Review ACLs for the Analytics VLAN Backlog 2017-02-08 2017-06-27
154090 Add "Interwicket" to the list of bots Backlog 2017-02-14 2017-03-01
153029 EventBus logs don't show up in logstash Backlog 2017-02-24 2017-07-13
159233 Create dashboard for upload wizard Backlog 2017-03-01 2017-07-03
160370 Initial Launch of new Wikistats 2.0 website Backlog 2017-03-14 2017-05-02
160748 Create /v2/schema/:schema_uri endpoint for eventstreams that proxies schemas from eventbus Backlog 2017-03-18 2017-06-22
161147 Provide cumulative edit count in Data Lake edit data Backlog 2017-03-23 2017-07-13
161326 Collaborate with zero on asiacell report Backlog 2017-03-25 2017-06-28
161824 Add zero carrier to pageview_hourly data on druid Backlog 2017-03-31 2017-06-01
161896 Add time_to_user_next_edit and time_to_page_next_edit in Mediawiki Denormalized History Backlog 2017-04-01 2017-07-12
162034 Create purging script for mediawiki-history data Backlog 2017-04-04 2017-06-27
162365 ExternalLinksChange Logging instrumentation is completely broken Backlog 2017-04-07 2017-05-05
162610 Implement EventLogging Hive refinement Backlog 2017-04-11 2017-06-07
162817 Design document for wikistats prototype backend Backlog 2017-04-13 2017-05-02
163229 Review by legal department of text on wikistats site Backlog 2017-04-19 2017-07-17
163327 Add monthly unique devices dataset to pivot Backlog 2017-04-20 2017-04-19
163697 Check how pivot updates schema (or maybe make schema explicit on pivot) Backlog 2017-04-25 2017-04-28
163817 Implement pageview metric in Wikistats UI Backlog 2017-04-26 2017-06-12
164021 Create tagging udf Backlog 2017-04-28 2017-07-06
164020 Use hive dynamic partitioning to split webrequest on tags Backlog 2017-04-28 2017-07-14
164019 Webrequest tagging and distribution. Measuring non-pageview requests Backlog 2017-04-28 2017-06-27
164194 Unique Devices on Pivot, initial screen should not add values by default, is this configurable? Backlog 2017-05-02 2017-05-08
164243 Alarms on pageview API latency increase Backlog 2017-05-03 2017-06-08
164409 EventLogging tests fail for python 3.4 in Jenkins Backlog 2017-05-04 2017-07-17
164497 Cleaning scheme for banner data _SUCCESS files Backlog 2017-05-05 2017-05-11
94596 Make oozie work with spark jobs that use HiveContext Backlog 2017-05-06 2017-07-11
165366 rack/setup/install replacement stat1006 (stat1003 replacement) Backlog 2017-05-16 2017-07-20
165368 rack/setup/install replacement to stat1005 (stat1002 replacement) Backlog 2017-05-16 2017-07-19
166162 Update puppet for new Kafka cluster and version Backlog 2017-05-24 2017-07-11
166167 Write generic certificate management software for use with Puppet and Self Signing CAs. Backlog 2017-05-24 2017-07-11
167033 https://dumps.wikimedia.org/other/pageviews/ needs a README Backlog 2017-06-06 2017-06-15
167304 Understand Kafka ACLs and figure out what ACLs we want for production topics Backlog 2017-06-08 2017-07-19
166335 Warning: JsonConfig: Invalid $wgJsonConfigModels['JsonConfig.Dashiki'] array value, 'class' not found Backlog 2017-06-09 2017-07-20
167539 Final steps to expose project wide unique devices data Backlog 2017-06-10 2017-06-15
167494 Some fields in Pivot should be numbers Backlog 2017-06-10 2017-06-23
167673 Address design feedback from Volker Backlog 2017-06-13 2017-06-12
167676 Implement Topic Selector Widget Backlog 2017-06-13 2017-06-12
167674 Re-read Round 2 feedback on mediawiki and make any critical items into tasks Backlog 2017-06-13 2017-06-12
167672 Routing Backlog 2017-06-13 2017-06-12
168303 dbstore1002 /srv filling up Backlog 2017-06-20 2017-07-18
168415 Make refinery drop data scripts email analytics-alerts if they fail Backlog 2017-06-21 2017-06-26
168414 Purge all old data from EventLogging master Backlog 2017-06-21 2017-06-23
168477 Update html language for per-domain uniques Backlog 2017-06-21 2017-06-20
168874 Add normalized_host.project_family and deprecate and remove normalized_host.project_class Backlog 2017-06-27 2017-07-14
168927 Smartctl errors for one kafka1012 disk Backlog 2017-06-28 2017-07-19
169101 Make banner realtime jobs more resilient Backlog 2017-06-29 2017-06-29
169248 Adding MAILTO to crontab for camus job Backlog 2017-06-30 2017-06-30
169245 Document revision-create event for EventStreams Backlog 2017-06-30 2017-07-17
169550 Final Vetting of Family Wide unique devices data Backlog 2017-07-04 2017-07-12
169572 Push mediawiki history data into labs. Public history data lake Backlog 2017-07-04 2017-07-06
170157 decommission rcs100[12] Backlog 2017-07-11 2017-07-19
170463 Addition of (mock) Active Editors metric Backlog 2017-07-13 2017-07-17
170461 Addition of Unique Devices metric Backlog 2017-07-13 2017-07-17
170486 ChangesListHighlights events missing from MySQL starting 2017-07-11 Backlog 2017-07-13 2017-07-18
170459 Cleanup Routing code Backlog 2017-07-13 2017-07-20
170457 Define, Document (and test) Desktop and Mobile browser support for wikistats 2.0 Backlog 2017-07-13 2017-07-19
170496 Firewalls appear to be preventing spark executors from talking to spark driver on stat1005 Backlog 2017-07-13 2017-07-17
170493 Mediawiki History Druid indexing failed Backlog 2017-07-13 2017-07-19
170471 Move statistics::discovery jobs from stat1002 -> stat1005 Backlog 2017-07-13 2017-07-13
170458 Set up continuous integration for wikistats 2.0 UI Backlog 2017-07-13 2017-07-18
170460 Wikistats 2.0 UI second deployment/iteration Backlog 2017-07-13 2017-07-18
170522 deployment-eventlogging03 out of disk space Backlog 2017-07-14 2017-07-20
170602 License for pageview data Backlog 2017-07-14 2017-07-21
170590 Upgrade Druid to 0.9.2 as a temporary measure Backlog 2017-07-14 2017-07-13
170720 Clean up PageContentSaveComplete event if there are no data users Backlog 2017-07-15 2017-07-17
170764 Migrate table creation query to oozie Backlog 2017-07-17 2017-07-17
170878 Audit users and account expiry dates for stat boxes Backlog 2017-07-18 2017-07-18
170882 Implement some example metrics as Druid queries Backlog 2017-07-18 2017-07-17
170845 Pageview drop in ro.wikipedia hu.wikipedia and fr.wikipedia Backlog 2017-07-18 2017-07-17
170925 Ensure indexes are added to `meta_dt` and unique `meta_id` fields in eventbus MySQL tables in eventlogging databases. Backlog 2017-07-19 2017-07-19
170952 Inconsistent default charset for analytics slaves Backlog 2017-07-19 2017-07-19
111611 Accented letters seem to be rejected in cohort names In Progress 2016-04-02 2016-08-22
116658 Add Application errors for Mediawiki API to x-analytics In Progress 2016-04-02 2017-01-17
116188 Add help page in wikitech on what the analytics team can do for you similar to release engineering page In Progress 2016-04-02 2017-02-09
124319 Add json linting test for schemas in mediawiki/event-schemas In Progress 2016-04-02 2016-05-02
88775 Add mediacounts to pageview API In Progress 2016-04-02 2017-07-03
87604 Add ops-reportcard dashboard with analysis that shows the http to https slowdown on russian wikipedia In Progress 2016-04-02 2017-03-01
92875 Add page_id and namespace to X-Analytics header in App / api requests In Progress 2016-04-02 2017-05-25
126279 Add pivot parameter to tabular layout graphs {lama} In Progress 2016-04-02 2017-04-26
110459 Allow clicking on links in annotations In Progress 2016-04-02 2017-02-09
108757 Allow opting out from logging some of the default EventLogging fields on a schema-by-schema basis In Progress 2016-04-02 2017-04-26
89726 Analyze difference in Edit Schema "bounce rates" across wikis {lion} In Progress 2016-04-02 2016-09-26
124082 As an end-user I shouldn't see non-articles in the list of top articles In Progress 2016-04-02 2017-05-16
118842 Backfill pageview_hourly sanitization - 1 month - {hawk} - DUPLICATE THIS TASK FOR EACH MONTH TO BACKFILL In Progress 2016-04-02 2017-05-16
121912 Better redirect handling for pageview API In Progress 2016-04-02 2017-07-03
120330 Bot to call global metrics to event page {kudu} In Progress 2016-04-02 2016-01-12
131127 Break down "Other" a little more? In Progress 2016-04-02 2017-05-16
130832 cdh::hadoop::directory (and other hdfs puppet command?) should quickly check if namenode is active before executing In Progress 2016-04-02 2017-04-26
121286 Central repository of global metrics reports {kudu} In Progress 2016-04-02 2016-08-22
113695 Clean the code review queue of analytics/wikistats In Progress 2016-04-02 2016-02-29
113817 Connect Hadoop records of the same request coming via different channels In Progress 2016-04-02 2016-10-04
123958 Consider scrapping Schema:PageContentSaveComplete and Schema:NewEditorEdit, given we have Schema:Edit In Progress 2016-04-02 2016-02-16
90240 Could it be that the geo IP matching is not accurate for Africa? In Progress 2016-04-02 2016-08-15
116678 Country mapping routine for proxied requests In Progress 2016-04-02 2017-07-20
117290 Create a tool that can read the elements of a wiki template and call the API {kudu} In Progress 2016-04-02 2016-02-18
119897 Create cron on 1002 to remove CirrusSearchRequest partitions In Progress 2016-04-02 2016-01-12
90759 Create Daily & Monthly pageview dump with country data In Progress 2016-04-02 2017-07-13
112284 Create new table for 'referer' aggregated data In Progress 2016-04-02 2017-04-26
95456 "Create Report" button does not appear when uploading a new cohort In Progress 2016-04-02 2016-09-19
127995 Create table in hive with a continent lookup for countries In Progress 2016-04-02 2017-04-26
92502 Dashboard Directory Design Feedback In Progress 2016-04-02 2017-05-16
120713 delete useless wikimetrics.report or cohort records In Progress 2016-04-02 2017-02-09
118841 Deploy pageview sanitization and start ongoing process {hawk} In Progress 2016-04-02 2017-05-16
131158 Describe threat model for sanitized pageview data {mole} In Progress 2016-04-02 2017-07-17
112846 Display automata and humans separately on zero results rate graph In Progress 2016-04-02 2016-01-12
121262 Display global metrics report results on same page as report inputs {kudu} In Progress 2016-04-02 2016-08-22
121561 Encrypt Kafka traffic, and restrict access via ACLs In Progress 2016-04-02 2017-07-11
125394 Ensure that EventBus extension gracefully handles service failures In Progress 2016-04-02 2016-10-12
117221 [Epic] Update official Wikimedia press kit with accurate numbers In Progress 2016-04-02 2017-07-10
121136 Establish a process to periodically review and approve access for hadoop/hue users In Progress 2016-04-02 2016-08-22
119094 Expose pageview data in each project's REST API In Progress 2016-04-02 2017-07-12
118310 Expose the results of the global metric at a public link, that's available immediately for the API {kudu} In Progress 2016-04-02 2016-03-14
116578 Fix layout of the daily email that sends pageview dataset status In Progress 2016-04-02 2017-04-26
117018 Fix the pageview API "top" spec and 404 reporting {slug} In Progress 2016-04-02 2017-04-17
89453 Hand off of Christian's MaxMind geolocation databases repository In Progress 2016-04-02 2017-07-03
119996 Have dashiki read and write GET params to pass stateful versions of dashboard pages {crow} In Progress 2016-04-02 2017-04-26
98831 Honor DNT header for access logs & varnish logs In Progress 2016-04-02 2017-05-16
115119 Implement Schema:ExternalLinksChange In Progress 2016-04-02 2017-07-17
127850 Improve Hue user management In Progress 2016-04-02 2017-07-03
117945 Investigate anomalous views to pages with replacement characters In Progress 2016-04-02 2017-07-13
89397 Investigate getting redirect_page_id as an x_analytics field using the X analytics extension. {pika} In Progress 2016-04-02 2017-07-03
114469 Investigate US traffic by state normalized by population In Progress 2016-04-02 2017-07-03
115634 --- Items above are triaged ----------------------- In Progress 2016-04-02 2017-07-06
108414 Load API request count and latency data from Hadoop to a dashboard In Progress 2016-04-02 2017-06-07
126501 Load Avro schemas from configurable external path In Progress 2016-04-02 2017-04-26
126494 Look into encrypting logs sent between mediawiki app servers and kafka In Progress 2016-04-02 2017-04-26
118402 Make AQS return 0 instead of no values {slug} In Progress 2016-04-02 2017-04-17
115042 Make banner impression counts available somewhere public In Progress 2016-04-02 2017-07-17
114162 Make sunburst and stacked-bars resize with window {crow} In Progress 2016-04-02 2016-03-14
110903 Make webperf eventlogging consumers use eventlogging on Kafka In Progress 2016-04-02 2017-03-09
131280 Making geowiki data public In Progress 2016-04-02 2017-07-17
125345 Many error 500 from pageviews API "Error in Cassandra table storage backend" In Progress 2016-04-02 2016-12-08
102079 Metrics about the use of the Wikimedia web APIs In Progress 2016-04-02 2017-07-20
127571 Percentage of users with DNT on In Progress 2016-04-02 2016-09-19
105815 Pipeline for data-intensive applications from research to productization to integration In Progress 2016-04-02 2016-01-12
118839 Productionize Pageview_sanitization hive code with Oozie job and refinery inclusion {hawk} In Progress 2016-04-02 2017-05-16
126290 Provide API for sampling pageviews In Progress 2016-04-02 2017-04-26
117480 Provide machine readable directory indexes on http://datasets.wikimedia.org/aggregate-datasets/ In Progress 2016-04-02 2016-01-12
128623 Purge all Schema:Echo data after 90 days In Progress 2016-04-02 2016-03-07
112009 Re-baselining checkpoints periodically In Progress 2016-04-02 2016-01-12
116387 Refactor webrequest_source partitions and oozie jobs In Progress 2016-04-02 2017-04-26
126218 Replicate wikitech wikis to analytics-store.eqiad.wmnet In Progress 2016-04-02 2016-02-11
103726 Report page views for labs instances In Progress 2016-04-02 2016-09-19
125015 Requests to (hard) redirect pages return their target's contents but are counted as pageviews to the redirect page In Progress 2016-04-02 2016-01-28
122245 REST API entry point web request statistics at the Varnish level In Progress 2016-04-02 2017-07-11
120852 Send burrow lag statistics to statsd/graphite {hawk} In Progress 2016-04-02 2017-06-15
108850 Set up auto-purging after 90 days {tick} In Progress 2016-04-02 2017-02-01
114884 Some special characters break Wikimetrics' encoding {dove} In Progress 2016-04-02 2016-09-19
128374 Sort out analytics service dependency issues for cp* cache hosts In Progress 2016-04-02 2017-04-26
71145 Story: AnalyticsEng has editor_day table in labsdb In Progress 2016-04-02 2016-09-19
106034 Too few page views for June/July 2015 In Progress 2016-04-02 2016-01-28
117236 Track overall traffic, without any filtering, broken down into major categories, for internal use. In Progress 2016-04-02 2017-04-26
93217 Troubleshoot Wikimetrics RAE reports In Progress 2016-04-02 2016-09-19
113234 Update passport-mediawiki module URLs and documentation In Progress 2016-04-02 2017-03-16
114199 Upgrade eventlogging servers to Jessie In Progress 2016-04-02 2017-07-18
76914 User reads result of validation after creating a cohort In Progress 2016-04-02 2016-08-22
118772 Use scap3 to deploy eventlogging/eventlogging In Progress 2016-04-02 2017-03-08
76432 Validate JsonSchemaContent using MediaWIki core's handling In Progress 2016-04-02 2016-01-12
120036 Vital Signs: Please make the data for enwiki and other big wikis less sad, and not just be missing for most days In Progress 2016-04-02 2017-07-06
120037 Vital Signs: Please provide an "all languages" de-duplicated stream for the Community/Content groups of metrics In Progress 2016-04-02 2017-07-06
131782 Put data needed for edits metrics through Event Bus into HDFS In Progress 2016-04-05 2017-07-05
131938 Evaluate whether to rewrite varnishkafka in python In Progress 2016-04-07 2017-04-26
132691 Clean up property passing in dashiki In Progress 2016-04-15 2017-02-09
133407 fundraising-tech request: browser version breakdown by country In Progress 2016-04-23 2016-04-25
132405 Collect information about how we collect user statistics in one place In Progress 2016-04-26 2017-07-19
133575 Provide weekly top pageviews stats In Progress 2016-04-26 2017-04-26
134231 Wikipedia Clickstream dataset. Programmatic Access In Progress 2016-05-04 2017-07-17
134524 Pageview API: Limit (and document) size of data you can request In Progress 2016-05-06 2017-06-15
135174 Move contents of ee-dashboards to edit-analysis.wmflabs.org In Progress 2016-05-13 2017-03-22
135762 A/B Testing solid framework In Progress 2016-05-20 2016-12-06
135812 20160431 produces "end timestamp is invalid, must be a valid date in YYYYMMDD format" In Progress 2016-05-24 2016-05-23
136025 Simplify readiness checking by making a ready computed In Progress 2016-05-24 2017-02-09
136126 Better menu for metrics In Progress 2016-05-25 2017-02-09
136127 Breakdowns should be bookmarkeable In Progress 2016-05-25 2017-02-09
136125 Dashiki, Unique Devices and Pageview data breakdown doesn't work if any of the items are not available for the project In Progress 2016-05-25 2017-02-09
136732 Puppetize job that saves old versions of geoIP database In Progress 2016-06-02 2017-05-16
136858 Beeline does not print full stack traces when a query fails {hawk} In Progress 2016-06-03 2017-07-03
137321 Run ETL for wmf_raw.ActionApi into wmf.action_* aggregate tables In Progress 2016-06-09 2017-01-27
137454 Bot from an Azure cloud cluster is causing a false pageview spike (can we identify as bot?) In Progress 2016-06-10 2017-06-15
138396 Create ops dashboard with info like ipv6 traffic split In Progress 2016-06-23 2017-06-12
138426 Spike: Evaluate alternatives to varnishkafka: varnishevents In Progress 2016-06-23 2017-06-12
138505 Split opera mini in proxy or turbo mode In Progress 2016-06-24 2017-06-26
139019 statistics about edit conflicts according to page type In Progress 2016-07-01 2017-06-02
138207 [Open question] Improve bot identification at scale In Progress 2016-07-15 2017-06-15
141010 Adding top counts for wiki projects (ex: WikiProject:Medicine) to pageview API In Progress 2016-07-22 2017-06-15
141117 Report on Wikimedia's industry ranking In Progress 2016-07-23 2017-07-10
126281 [Regression] stats.wikipedia.org redirect no longer works ("Domain not served here") In Progress 2016-08-02 2016-08-08
141506 Suddenly outrageous higher pageviews for main pages In Progress 2016-08-03 2016-12-19
142073 Improve user management for AQS In Progress 2016-08-05 2017-07-06
142139 Top API user agents stats In Progress 2016-08-05 2017-07-13
142408 Better publishing of Annotations about Data Issues In Progress 2016-08-09 2017-05-16
142395 Improve initial load performance for dashiki dashboards In Progress 2016-08-09 2017-02-09
142535 Find out what happens to the old rows in the revision table In Progress 2016-08-10 2017-03-20
143689 Bookmarkable date filters for browser stats dashboard In Progress 2016-08-24 2017-03-20
143743 Set up the foundation for the ReviewStream feed In Progress 2016-08-24 2017-06-07
144299 Dashboards working on mobile In Progress 2016-08-31 2017-04-24
144637 Check eventbus Kafka cluster settings for reliability In Progress 2016-09-03 2017-05-15
144639 Propose metrics along with qualifiers for the press kit In Progress 2016-09-03 2017-03-07
144837 Investigate lowering "per-article" resolution data in AQS In Progress 2016-09-07 2017-04-26
144714 [REQUEST] Extract search queries from HTTP_REFERER field for a Wikibook In Progress 2016-09-07 2016-10-06
145164 Add fields needed by ERI to mediawiki.revision-create In Progress 2016-09-10 2017-05-24
145197 WMF pageview API (404 error) when requesting statitsics over around 1000 files on GLAMorgan In Progress 2016-09-10 2016-09-16
144100 Pageview dumps incorrectly formatted, looks like a result of possibly malicious activity In Progress 2016-09-13 2017-05-29
145828 passport-mediawiki-oauth doesn't support callback parameter In Progress 2016-09-16 2017-03-16
145935 Responses on pageview API should be lighter In Progress 2016-09-18 2017-05-16
146130 Inconsistent Cassandra disk load shown in metrics and nodetool status In Progress 2016-09-21 2017-04-25
146774 Add external link to tabs layout In Progress 2016-09-28 2017-02-09
146911 Quantify false positives when filtering for number of distinct user agents per page in top pages computation In Progress 2016-09-29 2017-06-15
147009 Create a Universal Layout for Dashiki for staging / testing config purposes In Progress 2016-09-30 2017-04-26
147196 EventLogging sees MobileWikiAppFindInPage parsing errors In Progress 2016-10-04 2016-11-09
147967 The WMF-Last-Access Set-Cookie header should follow RFC 2965 syntax rather than the pre-RFC Netscape format In Progress 2016-10-13 2016-10-17
148053 Switch to fetch away from jquery In Progress 2016-10-14 2017-04-25
148461 Bot Identification: Inconsistent data in #all-sites-by-os-and-browser for IE7 In Progress 2016-10-18 2017-05-16
148469 Just an idea: poly-graph In Progress 2016-10-18 2017-05-08
148656 Edit analysis dashboard Failures by User Type chart does not update correctly In Progress 2016-10-20 2017-07-17
148776 Screen cast for how to use Pivot (5 minutes) In Progress 2016-10-21 2017-06-05
148843 GPU upgrade for stats machine In Progress 2016-10-22 2017-07-11
149594 Delete stale topics from main Kafka clusters In Progress 2016-11-01 2017-07-11
149736 Bikeshed what events should be exposed in public EventStreams API In Progress 2016-11-02 2017-06-22
150028 RFC: Requirements for analytics stats processor In Progress 2016-11-05 2017-01-03
150343 Puppetize clickhouse In Progress 2016-11-10 2017-06-15
150483 Set up a fake Pageview API endpoint for the beta cluster In Progress 2016-11-11 2017-05-16
150439 Tests for swagger spec stream routes in EventStreams In Progress 2016-11-11 2017-07-13
150713 Provide filterable line graph for browser-family/browser-major In Progress 2016-11-15 2017-05-08
151211 kafka alarms audit In Progress 2016-11-22 2017-07-10
151904 User limits for stat machines. Limit space on /home dir and possibly /tmp In Progress 2016-11-30 2017-07-06
152222 Make aggregated MediaWiki Pingback data publicly available In Progress 2016-12-03 2017-05-02
152257 Report updater should support Graphite mapping plugins In Progress 2016-12-04 2017-07-03
152546 de-duplicate archive records matching revision records in mediawiki_history In Progress 2016-12-07 2017-04-26
152522 Prevent notebooks on spark to launch 2 pyspark instances instead of 1 In Progress 2016-12-07 2017-06-05
152731 Implement server side filtering (if we should) In Progress 2016-12-09 2017-06-22
6537 Categories created using Tamil words not recognised in stats In Progress 2016-12-23 2017-01-23
153702 Update active editor metrics to use consensus definition In Progress 2016-12-23 2017-01-23
155014 Import 2001 wikipedia data In Progress 2017-01-11 2017-05-08
155478 Copy cached API requests from raw webrequests table to ApiAction In Progress 2017-01-18 2017-07-13
155507 Meta-statistics on MediaWiki history reconstruction process In Progress 2017-01-18 2017-06-15
155804 log-events topic emitted in EventBus In Progress 2017-01-21 2017-02-16
58575 Browser and platform stats for logged-in vs. anon users for security and product support decisions In Progress 2017-01-24 2017-06-15
156037 Load cirrussearch data into druid In Progress 2017-01-24 2017-01-30
102476 RFC: Requirements for change propagation In Progress 2017-01-26 2017-01-30
156523 Investigate adding user-friendly testing functionality to Reportupdater In Progress 2017-01-28 2017-04-26
61832 Remove ad-hoc UA logging from existing schemas In Progress 2017-01-28 2017-03-13
156657 dashiki should execute tests on jenkins In Progress 2017-01-31 2017-02-09
154912 Is User-Agent data PII when associated with Action API requests? In Progress 2017-01-31 2017-07-20
156656 Review parent task for any potential pageview definition improvements In Progress 2017-01-31 2017-05-29
156844 Prep to decommission old dbstore hosts (db1046, db1047) In Progress 2017-02-01 2017-06-22
156965 Remove user_agent_map from pageview_hourly long term In Progress 2017-02-02 2017-05-16
157092 Support per-topic configuration in EventBus service In Progress 2017-02-04 2017-04-26
157697 Add error component to Dashiki In Progress 2017-02-10 2017-04-26
157705 Kafka mirror maker failures when kafka brokers are restarted In Progress 2017-02-10 2017-07-13
157978 Blog post about druid In Progress 2017-02-14 2017-06-05
157981 Serve global unique device counts externally In Progress 2017-02-14 2017-06-08
157977 upgrade druid and pivot In Progress 2017-02-14 2017-07-13
125829 Potentially decrease db1046's InnoDB buffer pool In Progress 2017-02-15 2017-02-16
158106 Update EventBus RCFeed config to use newly refactored settings In Progress 2017-02-15 2017-06-22
158166 Discuss labsdb visibility of rev_text_id and ar_comment In Progress 2017-02-16 2017-03-13
158334 Make Spark 2.1 easily available on new CDH5.10 cluster In Progress 2017-02-17 2017-07-13
157088 [EPIC] Develop a JobQueue backend based on EventBus In Progress 2017-02-18 2017-07-11
76782 security review of Wikimetrics {dove} In Progress 2017-02-19 2017-02-23
71462 Wikimetrics is not supporting mlwiki cohort In Progress 2017-02-19 2017-02-23
158896 Provide a spark job processing history and text to extract citations diffs In Progress 2017-02-24 2017-03-02
158972 productionize ClickStream dataset In Progress 2017-02-25 2017-07-17
159046 Track page views by page ID rather than title (handles moved pages) In Progress 2017-02-27 2017-07-17
159170 Find an alternative query interface for eventlogging on analytics cluster that can replace MariaDB In Progress 2017-02-28 2017-07-03
139487 Get 'sparklyr' working on stats1002 In Progress 2017-02-28 2017-04-26
159269 Clean up remaining Dashiki configs on meta In Progress 2017-03-01 2017-07-13
159264 Prototype counting of requests with real time (streaming data) In Progress 2017-03-01 2017-07-03
159337 Automate refinery jar cleanup In Progress 2017-03-02 2017-06-12
159584 Secure hue and other private data access sites with 2FA In Progress 2017-03-04 2017-05-16
159840 Alarm on data quality issues In Progress 2017-03-08 2017-07-17
159962 Spike: Spark 2.x as cluster default (working with oozie) In Progress 2017-03-09 2017-06-15
160311 Sort inconsistency in AQS timestamp behavior In Progress 2017-03-14 2017-07-13
160630 Please update Tulu Language(tcy)in Wikipedia Statistics. In Progress 2017-03-17 2017-06-26
160679 Test pushing EventLogging through the ELK stack In Progress 2017-03-17 2017-03-20
160822 Filter local IPs before checking for geo info In Progress 2017-03-19 2017-06-26
161027 Puppetize event schema topic configuration In Progress 2017-03-22 2017-07-10
161149 Provide edit tags in the Data Lake edit data In Progress 2017-03-23 2017-07-17
161146 Provide historical redirect flag in Data Lake edit data In Progress 2017-03-23 2017-07-17
161185 Pivot "MediaWiki history" data lake: Feature request for "Event Users" In Progress 2017-03-24 2017-07-13
161186 Pivot "MediaWiki history" data lake: Feature request for "Time" dimension to sp\lit by calendar month / quarter / year -- needs druid 0.10 In Progress 2017-03-24 2017-06-29
161635 Storage for banner history data In Progress 2017-03-29 2017-04-04
161731 Create reliable change stream for specific wiki In Progress 2017-03-30 2017-07-06
162308 Eventlogging client needs to support offline events In Progress 2017-04-06 2017-04-26
119772 Create dashboard showing MediaWiki tarball download statistics In Progress 2017-04-10 2017-04-10
162618 Measure portal pageviews In Progress 2017-04-11 2017-07-10
162770 SATA errors for stat1004 in the dmesg In Progress 2017-04-13 2017-07-21
162933 Endpoint for average view rate in Pageview API In Progress 2017-04-14 2017-05-08
44360 Provide regular cross-wiki reports on flagged revisions status In Progress 2017-04-16 2017-04-30
163113 Adding breakdowns to mw edit history reconstruction: wiki projects, categories (cohort) In Progress 2017-04-18 2017-07-17
163134 Reportupdater writes a README in the output folder In Progress 2017-04-18 2017-04-26
163252 Add templating support to reportupdater scripts In Progress 2017-04-19 2017-04-26
163177 can't compile numpy on stat1004 In Progress 2017-04-19 2017-06-26
163379 Create JobQueue implementation that posts to EventBus In Progress 2017-04-20 2017-07-18
163380 Support posting Jobs to EventBus simultaneously with normal job processing In Progress 2017-04-20 2017-07-12
163725 Enable nested on-wiki config pages in mediawiki-storage In Progress 2017-04-25 2017-05-04
163797 add a more friendly message to ladp authentication box for pivot In Progress 2017-04-26 2017-05-08
163789 Investigate whether we could calculate "hourly unique devices" In Progress 2017-04-26 2017-05-08
163933 Investigate oozie suspended workflows In Progress 2017-04-27 2017-05-04
163907 Monitor hdfs-balancer In Progress 2017-04-27 2017-05-18
164008 Update druid to latest release In Progress 2017-04-28 2017-05-25
164201 AQS unique devices api should report offset/underestimate separately In Progress 2017-05-02 2017-05-04
164259 Add VSL error counters to Varnishkafka stats In Progress 2017-05-03 2017-05-18
164280 Host API for token persistence dataset In Progress 2017-05-03 2017-05-04
164348 Investigate the use of local_quorum for AQS In Progress 2017-05-04 2017-07-13
164377 [Spike] Store article quality data inside hadoop and make AQS outputs a public API In Progress 2017-05-04 2017-06-07
164500 Add jobs for druid compaction for pageviews data set In Progress 2017-05-05 2017-05-04
164596 Provide uniques offset/underestimate breakdowns in AQS In Progress 2017-05-06 2017-07-17
164593 Provide unqiues estimate/offset breakdowns in AQS In Progress 2017-05-06 2017-05-25
165233 Data Lake edit data missing for many wikis In Progress 2017-05-14 2017-07-11
165309 Create small sample mediawiki-history table in MariaDB In Progress 2017-05-16 2017-06-08
165634 Add li: Wikibooks to Wikistats In Progress 2017-05-18 2017-05-25
165560 Artificial spike in offset of unique devices from November to February 6th on wikidata In Progress 2017-05-18 2017-06-26
165736 Update Varnishkafka to support TLS encryption/authentication In Progress 2017-05-20 2017-05-29
166249 Bulk/Batch event endpoint In Progress 2017-05-25 2017-06-08
166248 Upgrade Analytics Cluster to Java 8 In Progress 2017-05-25 2017-07-03
166339 Are watchlists dead? In Progress 2017-05-26 2017-05-29
166331 Pivot - Article Page Views In Progress 2017-05-26 2017-05-29
150106 Type collisions in log events causing indexing failures in ELK Elasticsearch In Progress 2017-05-26 2017-05-25
166414 Explore NavigationTiming by faceted properties - EventLogging refine In Progress 2017-05-27 2017-07-13
166341 SSDs for main Kafka clusters In Progress 2017-05-30 2017-05-29
166689 Provide top domain and data to truly test superset In Progress 2017-06-01 2017-07-03
166679 Reinstate a subset of reports removed from the reportcard until WikiStats 2.0 is back In Progress 2017-06-01 2017-06-08
166712 Remove logging from labs for schema https://meta.wikimedia.org/wiki/Schema:CommandInvocation In Progress 2017-06-01 2017-06-05
166832 Import Kafka messages into HDFS authenticating with TLS/SSL In Progress 2017-06-02 2017-07-11
166833 Produce webrequests from varnishkafka to Kafka with Kafka message timestamp set to configurable content field In Progress 2017-06-02 2017-07-11
166937 Broken /a/refinery-source/guard/run_all_guards.sh script on stat1002 In Progress 2017-06-04 2017-07-10
166877 Bring the Editor Engagement Dashboard back In Progress 2017-06-06 2017-06-05
167039 Upgrade Kafka on main cluster with security features In Progress 2017-06-06 2017-06-05
167180 Emit revision-score event to EventBus and expose in EventStreams In Progress 2017-06-07 2017-07-11
167290 Sqoop wbc_entity_usage from all wikis into hadoop (HDFS) In Progress 2017-06-08 2017-06-12
167427 Configure superset to query mysql slaves In Progress 2017-06-09 2017-06-08
167608 Add caused_by_user_text to mediawiki_page_history In Progress 2017-06-12 2017-06-12
143819 Data request for logs from SparQL interface at query.wikidata.org In Progress 2017-06-13 2017-07-05
167790 Refactor puppet code for the Hadoop Analytics cluster to roles/profiles In Progress 2017-06-14 2017-07-21
163233 Implement Varnish-level rough ratelimiting In Progress 2017-06-15 2017-07-17
167907 Incorporate data from the GeoIP2 ISP database to webrequest In Progress 2017-06-15 2017-06-26
167992 rack/setup/install new kafka nodes kafka-jumbo100[1-6] In Progress 2017-06-16 2017-07-20
107250 Remove links in wikistats to minnan.wikipedia.org In Progress 2017-06-16 2017-06-15
168390 Alarm on HDFS related script failures In Progress 2017-06-21 2017-07-06
168573 Dashiki Cleanup In Progress 2017-06-22 2017-06-21
168554 Default hive table creation to parquet - needs hive 2.3.0 In Progress 2017-06-22 2017-06-29
168538 Perf test RAID vs JBOD with new hardware and kafka versions In Progress 2017-06-22 2017-07-19
168550 Productionize Tranquility (or shut it off) In Progress 2017-06-22 2017-07-10
168648 Productionize analysis of editcount vs per_user_revision_count In Progress 2017-06-23 2017-06-26
169270 Site for Wikimedia Analytics lacks clear license In Progress 2017-07-04 2017-07-06
169672 Bucketize userEditCount in EL instrumentation In Progress 2017-07-05 2017-07-06
169674 Fix EventLogging editCountBucket fields historically In Progress 2017-07-05 2017-07-06
153033 Drop MoodBar tables from all wikis In Progress 2017-07-06 2017-07-06
169900 ---------------- Discussed above -------------------- In Progress 2017-07-07 2017-07-10
170019 Script that synchronizes EL purging white-list with schema talk pages In Progress 2017-07-08 2017-07-10
170145 Add parsedcomment to recentchange stream In Progress 2017-07-11 2017-07-20
170162 hdfs password file for mysql should be re-generated when the password file is changed by puppet In Progress 2017-07-11 2017-07-10
89887 Clean up permissions for privatedata files on stat1002 - they should be group readable by statistics-privatedata-users In Progress 2017-07-12 2017-07-13
170429 Deploy Wikistats and analytics.wikimedia.org via SCAP In Progress 2017-07-13 2017-07-13
170606 Add Accept header to webrequest logs In Progress 2017-07-14 2017-07-17
170620 Alarm on errors on /var/log/upstart/eventlogging* files In Progress 2017-07-14 2017-07-17
170864 Delete eventlogging alerts e-mail list In Progress 2017-07-18 2017-07-18
170826 Enable base::firewall on stat boxes after restricting Spark REPL ports. In Progress 2017-07-18 2017-07-17
170790 Upgrade AQS to node 6.11 In Progress 2017-07-18 2017-07-17
170850 Visualize page create events for all wikis In Progress 2017-07-18 2017-07-18
170990 Add index to mediawiki_page_create_1 table In Progress 2017-07-19 2017-07-19
170933 Wikistats2 bugs (1/4) - Dashboard and general UI In Progress 2017-07-19 2017-07-18
170936 Wikistats2 bugs (2/4) - Wiki selector In Progress 2017-07-19 2017-07-18
170937 Wikistats2 bugs (3/4) - Data issues In Progress 2017-07-19 2017-07-18
170940 Wikistats2 bugs (4/4) - Detail page In Progress 2017-07-19 2017-07-18
171099 CamusPartitionChecker does not work when topic names have '.' or '-' in them. In Progress 2017-07-20 2017-07-19
171048 Eventbus does not handle gracefully changes in DNS recursors In Progress 2017-07-20 2017-07-20
171117 Track zim file downloads In Progress 2017-07-20 2017-07-20
171011 Update EventStreams to service-template-node v0.5.2 In Progress 2017-07-20 2017-07-19
170650 [Investigate] Hadoop integration for ORES training In Progress 2017-07-21 2017-07-20
171203 Run eventlogging purging script on beta labs to avoid disk getting full In Progress 2017-07-21 2017-07-20