TechnicalOperations Status Report

Project Operations from 2018-07-01 to 2018-10-01

Help

Network Operations 205340 analytics1-a VLAN has no DNS for gateway addresses to match other analytics VLANs Screep Done None
Network Operations 201149 cr1/2-eqiad PFE_FW_SYSLOG_IP6_GEN log entries Screep Done None
Network Operations 204730 Enable cumin2001 in router ACLs Screep Done None
Network Operations 205513 Enable cumin1001 in router ACLs Screep Done None
Network Operations 201095 asw2-a-eqiad VC link down Screep Done None
Network Operations 185171 replace mr1-eqiad In-Scope Done None
Network Operations 199341 New PFW policy for Amazon Screep Done None
Network Operations 196030 troubleshoot cr3/cr4 link In-Scope Done None
Network Operations 195365 cp intermittent IPsec MTU issue In-Scope Done None
Network Operations 199530 Rename of wasat to mwmaint2001 (switch labels et al) Elaborated Done None
Network Operations 189519 Audit switch ports/descriptions/enable In-Scope Done None
Network Operations 196941 Rack/setup cr2-eqdfw In-Scope Done None
Network Operations 183390 unrack/decom pfw1-eqiad and pfw2-eqiad In-Scope Done None
Network Operations 199779 Update core routers routing for labtest Cloud VPS deployment Screep Done None
Network Operations 202520 set up NAT from 208.80.155.15 to frpig1001 Screep Done None
Network Operations 181036 Pull netflow data in realtime from Kafka via Tranquillity/Spark In-Scope Done None
Network Operations 198623 Review analytics-in4/6 rules on cr1/cr2 eqiad Screep Done 21.0
Network Operations 199821 connect eth1 on labtestnet2002 and labtestnet2003 Screep Done None
Network Operations 205574 deploy new pfw config Screep Done None
Network Operations 202952 rancid pubkey auth to Junos 17.4 failure Screep Done None
Network Operations 202536 adjust NAT for 208.80.152.231 (codfw bastion) to point to frbast2001 (10.195.0.67) Screep Done None
Network Operations 201694 Move servers off asw2-a-eqiad Screep Done None
Network Operations 190424 modify labs-hosts1-vlans for http load of installer kernel In-Scope Done None
Network Operations 122210 Security audit for tftp on install1001 In-Scope Done None
Network Operations 203719 Interface errors on cr2-eqiad:xe-4/0/0 Screep Done None
Network Operations 199832 Unexpected network packets in codfw mgmt Screep Done None
Network Operations 201414 Use dns100[12] as ntp servers in eqiad networking equipment Elaborated Done None
Network Operations 204743 Ensure scs-c1-eqiad:eth1 is not connected Screep Done None
Network Operations 185926 review and fix scs config In-Scope Done None
Network Operations 202075 Move asw2-a<->cr1 uplink back to asw-a Screep Done None
Network Operations 202846 Update ACLs for newer graphite hosts Elaborated Done None
Network Operations 198516 NAT and DNS for fundraising monitor host In-Scope Done None
Network Operations 98006 Anycast (Auth)DNS In-Scope Open None
Network Operations 200277 OSPF metrics Screep Open None
Network Operations 184067 Complete router migration from cr1-esams to cr3-esams In-Scope Open None
Network Operations 204271 Grow frack-administration-codfw to /28 Screep Open None
Network Operations 204170 Rack/setup cr2-eqord Screep Open None
Network Operations 189689 Connection timeout from 195.77.175.64/29 to text-lb.esams.wikimedia.org In-Scope Open None
Network Operations 193496 Allocate public v4 IPs for Neutron setup in eqiad In-Scope Open None
Network Operations 187960 Rack/cable/configure asw2-a-eqiad switch stack In-Scope Open None
Network Operations 196432 Configure interface damping on primary links In-Scope Open None
Network Operations 196489 upgrade all codfw switch stacks to include additional 10G switch per row In-Scope Open None
Network Operations 173698 Backfill librenms data in graphite with historical RRDs In-Scope Open None
Network Operations 183585 Rack/cable/configure asw2-b-eqiad switch stack In-Scope Open None
Network Operations 83992 Juniper monitoring In-Scope Open None
Network Operations 186021 reconfigure esams switch port for new bastion In-Scope Open None
Network Operations 186550 Anycast recdns In-Scope Open None
Network Operations 190090 Offload pings to dedicated server In-Scope Open None
Network Operations 190364 eqiad 10G ports needs In-Scope Open None
Network Operations 201039 connectivity issues between several hosts on asw2-b-eqiad Screep Open None
Network Operations 196487 upgrade row d to have 3 10G switches In-Scope Open None
Network Operations 106056 set up a looking glass for WMF ASes In-Scope Open None
Network Operations 163674 Frequent RST returned by appservers to LVS hosts In-Scope Open None
Network Operations 136671 Intermittent bandwidth issue to labs proxy (eqiad) from Comcast in Portland OR In-Scope Open None
Network Operations 167841 Cleanup confed BGP peerings and policies In-Scope Open None
Network Operations 201444 Refresh switch ports descriptions for recently renamed cloud servers Screep Open None
Network Operations 174616 set up cr3-esams In-Scope Open None
Network Operations 201097 Add virtual chassis port status alerting Screep Open None
Network Operations 171032 Investigate lvs IP pages during codfw row C switch upgrade In-Scope Open None
Network Operations 180179 Evaluate the possibility to add Juniper images to Openstack In-Scope Open None
Network Operations 203261 cr2-eqdfw (MX204) vhclient log noise Screep Open None
Network Operations 197147 Rack/Setup new codfw QFX5100 10G switch In-Scope Open None
Network Operations 174596 dmz_cidr only includes some wikimedia public IP ranges, leading to some very strange behaviour In-Scope Open None
Network Operations 189522 Detect IP address collisions In-Scope Open None
Network Operations 86541 setup wifi in codfw In-Scope Open None
Network Operations 204782 cr2-ulsfo crash Screep Open None
Network Operations 170144 Evaluate NetBox as a Racktables replacement & IPAM In-Scope Open None
Network Operations 172459 eqiad row D switch upgrade In-Scope Open None
Network Operations 199142 Increase network capacity (2018-19 Q1 Goal) Screep Open None
Network Operations 187929 Cloud IPv6 subnets In-Scope Open None
Network Operations 191667 Juniper HA audit In-Scope Open None
Network Operations 185337 rack spare switches in c1-eqiad In-Scope Open None
Network Operations 196946 switch port configuration for lvs200[7-10] In-Scope Open None
Network Operations 204281 Stop prioritizing peering over transit Screep Open None
Network Operations 167842 Find a new PIM RP IP In-Scope Open None
Network Operations 122406 Consider renumbering Labs to separate address spaces In-Scope Open None
Network Operations 124843 Peer with SFMIX at ULSFO in 200 Paul In-Scope Open None
Network Operations 167691 High amount of unexpected ICMP dest unreachable toward esams cache clusters In-Scope Open None
Network Operations 82038 create a test for multicast relay In-Scope Open None
Network Operations 201145 asw2-a-eqiad FPC5 gets disconnected every 10 minutes Screep Open None
Network Operations 133387 Enabling IGMP snooping on QFX switches breaks IPv6 (HTCP purges flood across codfw) In-Scope Open None
Network Operations 150264 Icinga check for VRRP In-Scope Open None
Network Operations 185151 replace msw1-esams In-Scope Open None
Network Operations 189552 Rack/cable/configure ulsfo MX204 In-Scope Open None
Network Operations 167306 ospf link-protection In-Scope Open None
Network Operations 174637 Setup esams atlas anchor In-Scope Open None
Network Operations 187962 Rack/cable/configure asw2-c-eqiad switch stack In-Scope Open None
Network Operations 196557 switch port configuration for frmon2001 In-Scope Open None
Network Operations 201139 Intermittent connectivity issues in eqiad's row C Screep Open None
Traffic 200178 Traffic Server packaging and initial puppetization Screep Done None
Traffic 196691 rack/setup/install dns100[12].wikimedia.org In-Scope Done None
Traffic 201174 cp1080 uncorrectable DIMM error slot A5 Elaborated Done None
Traffic 202682 Improve Accept header normalization in VCL for REST API Screep Done None
Traffic 124418 Investigate massive increase in htmlCacheUpdate jobs in Dec/Jan In-Scope Done None
Traffic 199525 Investigate NXDOMAIN DNS responses in our authdns servers Screep Done None
Traffic 180329 Add CI to all operations/software/varnish/* repositories and archive obsolete ones In-Scope Done None
Traffic 200304 Redirect design.wikimedia.org/style-guide/wiki/* to design.wikimedia.org/style-guide/ Screep Done None
Traffic 188776 Move Foundation Wiki to new URL when new Wikimedia Foundation website launches In-Scope Done None
Traffic 201175 cp1085 bad DAC/SFP? Elaborated Done None
Traffic 157786 Unhandled error stopping pybal: 'RunCommandMonitoringProtocol' object has no attribute 'checkCall' In-Scope Done None
Traffic 189305 cp3034: Uncorrectable Memory Error In-Scope Done None
Traffic 200207 Discard of cold, labeled VCL crashes varnish parent and child Screep Done None
Traffic 164609 Merge cache_misc into cache_text functionally In-Scope Done None
Traffic 205077 redirect wikipedia.gr to el.wikipedia.org Screep Done None
Traffic 189250 WP Zero workarounds for eqsin In-Scope Done None
Traffic 203194 cp1080 - kernel / bnxt_en failures Screep Done None
Traffic 179953 cp3043 disk failure In-Scope Done None
Traffic 182993 TLS security review of the Kafka stack In-Scope Done 13.0
Traffic 199717 Pick up a suitable ACME library for certcentral Screep Done None
Traffic 201630 False alarms on varnish-http-requests 70% GET drop in 30 min alert Screep Done None
Traffic 201522 Decommission chromium and hydrogen Screep Done None
Traffic 200445 Upgrade cache servers to stretch Screep Done None
Traffic 203179 Sort out HTTP caching issues for fixcopyright wiki Screep Done None
Traffic 200405 Provide a CI container with pebble Screep Done None
Traffic 194965 gdnsd plugin support for ACME DNS challenges In-Scope Done None
Traffic 191940 Investigate 2018-04-10 global traffic drop In-Scope Done None
Traffic 195923 rack/setup/install cp1075-cp1090 In-Scope Done None
Traffic 133410 Deploy TemplateStyles to WMF production In-Scope Done None
Traffic 203678 certcentral: Make configurable the cmd executed to perform a DNS zone update Screep Done None
Traffic 198922 Setup wikimediafoundation.org domain for July 30 launch of new site Screep Done None
Traffic 203422 certcentral: phantom test failure around challenge success Screep Done None
Traffic 201769 Significant increase in Time To First Byte on 2018-08-08, between 16:00 and 20:00 UTC Screep Done None
Traffic 176366 Decom cp4005-8,13-16 (8 nodes) In-Scope Done None
Traffic 181569 rack/setup scs-eqsin.mgmt.eqsin.wmnet In-Scope Done None
Traffic 168539 Unhandled pybal error: OpenSSL.SSL.Error - ssl handshake failure In-Scope Done None
Traffic 196693 rack/setup/install authdns1001.wikimedia.org In-Scope Done None
Traffic 133717 Letsencrypt all the prod things we can - planning In-Scope Done None
Traffic 192555 Begin execution of non-forward-secret ciphers deprecation In-Scope Done None
Traffic 147202 Removing support for AES128-SHA TLS cipher In-Scope Done None
Traffic 199720 Deploy initial ATS test clusters in core DCs Screep Done None
Traffic 200346 wmf.14 failing to execute ThumbnailRender jobs "error: ThumbnailRenderJob::run: HTTP request failure" Screep Done None
Traffic 196371 Provide a multi-language user-faced warning regarding AES128-SHA deprecation In-Scope Done None
Traffic 196974 cp3037 is currently unreachable In-Scope Done None
Traffic 187157 cp5006 unresponsive In-Scope Done None
Traffic 193865 Enable numa_networking on all caches In-Scope Done None
Traffic 204600 Pass on name of the node serving ORES requests as response header to the user Screep Done None
Traffic 147209 etcd cluster has Raft Internal errors sporadically In-Scope Open None
Traffic 190244 en-wp.org certificate error In-Scope Open None
Traffic 83467 LVS testing needs to include internal services testing In-Scope Open None
Traffic 191393 Puppet: tlsproxy localssl default_server make a Notify at each run In-Scope Open None
Traffic 170605 Unable to render file from upload.wikimedia.org "Error 349 ERR_RESPONSE_HEADERS_MULTIPLE_CONTENT_DISPOSITION" In-Scope Open None
Traffic 180434 Uncacheable content handling: hfp vs hfm In-Scope Open None
Traffic 101002 Use Upgrade Insecure Requests on Wikimedia wikis In-Scope Open None
Traffic 91372 $wgMFAnonymousEditing = true is sometimes not respected: cache? In-Scope Open None
Traffic 179025 LVS hosts should have static-mapped IPv6 on all virtual interfaces In-Scope Open None
Traffic 36670 Check all wikis for inclusions of http resources on https In-Scope Open None
Traffic 176875 Allow access to wdqs.svc.eqiad.wmnet on port 8888 In-Scope Open None
Traffic 98165 Figure out an etcd deploy strategy that includes multi DC failure scenarios. In-Scope Open None
Traffic 99216 Please set up a CNAME for videoserver.wikimedia.org to Video Editing Server In-Scope Open None
Traffic 204992 Puppetise OCSP stapling for all one-off HTTPS servers Screep Open None
Traffic 161148 AuthDNS CM/CI refactor In-Scope Open None
Traffic 202479 Investigate source of 404 Not Found responses from load.php Screep Open None
Traffic 172103 IPVS issues with UDP services, pybal depooling strategy In-Scope Open None
Traffic 130904 Host rewrite for /static/ not applied to purges In-Scope Open None
Traffic 178592 decommission/replace bast4001.wikimedia.org In-Scope Open None
Traffic 169765 pybal should automatically reconnect to etcd In-Scope Open None
Traffic 56783 Respect X-Forwarded-For only from trustworthy sources In-Scope Open None
Traffic 128559 store.wikimedia.org HTTPS issues In-Scope Open None
Traffic 184534 Cached page previews not shown when refreshed In-Scope Open None
Traffic 138093 Investigate query parameter normalization for MW/services In-Scope Open None
Traffic 198620 Consider using vmod_var instead of temporary headers in VCL Screep Open None
Traffic 118181 Planning for phasing out non-Forward-Secret TLS ciphers In-Scope Open None
Traffic 179050 setup bast4002/WMF7218 In-Scope Open None
Traffic 204993 Update certspotter Screep Open None
Traffic 140365 Lower geodns TTLs from 600 (10min) to 300 (5min) In-Scope Open None
Traffic 156462 Framework to transfer files over the LAN In-Scope Open None
Traffic 180921 Referrer policy for browsers which only support the old spec In-Scope Open None
Traffic 146332 Create short link for outreachdashboard.wmflabs.org In-Scope Open None
Traffic 147967 The WMF-Last-Access Set-Cookie header should follow RFC 2965 syntax rather than the pre-RFC Netscape format In-Scope Open None
Traffic 89838 Move proxy IP lists to META for Varnish XFF decoding In-Scope Open None
Traffic 180269 Wikimedia's recent upgrade to nginx v. 1.13.6 breaks older Android HTTP libraries In-Scope Open None
Traffic 128188 Make CI run Varnish VCL tests In-Scope Open None
Traffic 127573 wikiknihy.cz - transfer to Wikimedia Czech Republic? In-Scope Open None
Traffic 167513 Redirect lzh.wikipedia to zh-classical.wikipedia In-Scope Open None
Traffic 202381 Traffic Server - Prometheus integration Screep Open None
Traffic 102178 Fix RESTBase support for wikitech.wikimedia.org In-Scope Open None
Traffic 141480 mixed-content issues on planet.wikimedia.org In-Scope Open None
Traffic 180655 Phabricator and Gerrit: Improve the way that maintenance downtime is communicated to users. In-Scope Open None
Traffic 204994 Integrate certspotter with certcentral to avoid certspotter notifying us on legitimate certs generated by our certcentral boxes Screep Open None
Traffic 117826 TEST: redirect small portion of unauthenticated desktop users to mobile web In-Scope Open None
Traffic 162818 icinga alerts on nodejs services when a recdns server is depooled In-Scope Open None
Traffic 176388 pybal: race condition in alerts instrumentation In-Scope Open None
Traffic 146832 Clarify caching to enable direct Wikidata Query Service access by <mapframe/link> In-Scope Open None
Traffic 91820 Create HTTP verb and sticky cookie DC routing in VCL In-Scope Open None
Traffic 204208 puppetize http purging for ATS backends Screep Open None
Traffic 114104 pybal doesn't fully manage LVS table leaving stale services (on IP change) In-Scope Open None
Traffic 163141 dbtree: make wasat a working backend and become active-active In-Scope Open None
Traffic 132629 Data passed to HHVM ($_SERVER variables) is a mixed bag of already-decoded and non-decoded nonsense In-Scope Open None
Traffic 147162 upload.wikimedia.org returns HTTP 501 instead of 416 for non-satisfiable byte ranges In-Scope Open None
Traffic 180257 Puppet / LVS: confusion in service vs IP name In-Scope Open None
Traffic 194814 Reduce amount of headers sent from web responses In-Scope Open None
Traffic 181315 Varnish HTTP response from app servers taking 160s (only 0.031s inside Apache) In-Scope Open None
Traffic 178815 decom cp40(09|1[078]) In-Scope Open None
Traffic 186732 Decide on Cache-Control headers for map tiles In-Scope Open None
Traffic 171470 Monitor DNS delegations In-Scope Open None
Traffic 78963 Support ESI for ResourceLoader In-Scope Open None
Traffic 108580 HTTPS for internal service traffic In-Scope Open None
Traffic 190992 prometheus: slow dashboards due to suboptimal query_range performance In-Scope Open None
Traffic 194031 Setup a new PKI software as an alternative to the puppet CA for managing services certificates In-Scope Open None
Traffic 204232 Package and deploy ATS v8.x Screep Open None
Traffic 129839 restrict upload cache access for private wikis In-Scope Open None
Traffic 185239 Puppet hosts with signed certificate present on agent but not master In-Scope Open None
Traffic 109325 Outbound HTTPS for varnish backend instances In-Scope Open None
Traffic 164259 Add VSL error counters to Varnishkafka stats In-Scope Open None
Traffic 122867 Evaluate the feasibility of cache invalidation for the action API In-Scope Open None
Traffic 97051 adding new languages to DNS langs.tmpl doesn't work until zone template is edited as well In-Scope Open None
Traffic 205378 Enable ESNI support on Wikimedia servers Screep Open None
Traffic 167972 Respect host header in RESTBase, and redirect /rest_v1 to /rest_v1/ In-Scope Open None
Traffic 202564 https://sv.wikipedia.beta.wmflabs.org/ has invalid certificate Screep Open None
Traffic 147648 Unexplained increase in thumbnail 500s In-Scope Open None
Traffic 101525 Set up LVS for current AuthDNS In-Scope Open None
Traffic 137979 Support brotli compression In-Scope Open None
Traffic 125226 [feature request] Redirect root API path to docs page In-Scope Open None
Traffic 172124 PyBal Feature: progressive depooling strategy for monitored failures In-Scope Open None
Traffic 202046 cp3032 PS Redundancy Lost Screep Open None
Traffic 79730 Add pybal check to ensure service IP is bound In-Scope Open None
Traffic 150673 Thumb API: Varnish / CDN questions In-Scope Open None
Traffic 119366 Disable caching on the main page for anonymous users In-Scope Open None
Traffic 99531 [Task] move wikiba.se webhosting to wikimedia misc-cluster In-Scope Open None
Traffic 136703 Add LVS public endpoint checks that bypass caches In-Scope Open None
Traffic 143562 High number of failed inbound TFO connections in esams Mon-Fri In-Scope Open None
Traffic 120121 Improve Varnish XFF processing for trusted proxies In-Scope Open None
Traffic 192206 Remove wildcard vhost for *.wikimedia.org In-Scope Open None
Traffic 188804 Investigate and fix odd uri_host values In-Scope Open None
Traffic 183554 Unified certs bloat reduction? In-Scope Open None
Traffic 174932 Recurrent 'mailbox lag' critical alerts and 500s In-Scope Open None
Traffic 88861 wikipedia.lol In-Scope Open None
Traffic 179197 Investigate what caused the the unattended varnish upgrade in Beta Cluster In-Scope Open None
Traffic 178535 decommission lvs400[1-4].ulsfo.wmnet In-Scope Open None
Traffic 153468 Ferm's upstream Net::DNS Perl library bad handling of NOERROR responses without records causing puppet errors when we try to @resolve AAAA in labs In-Scope Open None
Traffic 167400 Disable serving unpatrolled new files to Wikipedia Zero users In-Scope Open None
Traffic 137747 Parametrization of VCL is inconsistent In-Scope Open None
Traffic 192368 Unconditional return(deliver) in vcl_hit In-Scope Open None
Traffic 203191 prometheus-varnish-exporter@frontend.service: Unit entered failed state - invalid character 'C' Screep Open None
Traffic 189290 Tune systemd journal rate limiting for PyBal In-Scope Open None
Traffic 66214 Define an official thumb API In-Scope Open None
Traffic 193521 Consider adding expect-CT: header to enforce certificate transparency In-Scope Open None
Traffic 161517 Allow anonymous users to change interface language on Commons with ULS In-Scope Open None
Traffic 188561 SSL cert for links.email.wikimedia.org In-Scope Open None
Traffic 194962 Create and deploy a centralized letsencrypt service In-Scope Open None
Traffic 109331 Deleted files sometimes remain visible to non-privileged users if permanently linked In-Scope Open None
Traffic 152622 Wikipedia.cz and other domains owned by WMCZ have invalid certificate In-Scope Open None
Traffic 111588 RFC: API-driven web front-end In-Scope Open None
Traffic 144508 Point wikipedia.in to 205.147.101.160 instead of URL forward In-Scope Open None
Traffic 198286 Decommission acamar and achernar In-Scope Open None
Traffic 194724 Deprecate `base::service_unit` in puppet In-Scope Open None
Traffic 202966 Make cp1099 the new pinkunicorn Screep Open None
Traffic 161360 404 loading images from Virgin Media In-Scope Open None
Traffic 201409 Harmonise the identification of requests across our stack Screep Open None
Traffic 196066 Add prometheus metrics for varnishkafka instances running on caching hosts In-Scope Open None
Traffic 113817 Connect Hadoop records of the same request coming via different channels In-Scope Open None
Traffic 152091 Block hotlinking In-Scope Open None
Traffic 78421 m.{project}.org portal/redirect consistency In-Scope Open None
Traffic 190607 cp3048 hardware issues In-Scope Open None
Traffic 192082 lvs2006 Embedded Flash/SD-CARD iLO errors In-Scope Open None
Traffic 204209 Define and deploy Icinga checks for ATS backends Screep Open None
Traffic 128374 Sort out analytics service dependency issues for cp* cache hosts In-Scope Open None
Traffic 190843 Spammy events coming our way for sites such us https://ru.wikipedia.kim In-Scope Open None
Traffic 141266 letsencrypt puppetization: add parallel rsa+ecdsa cert support In-Scope Open None
Traffic 136944 Set up LVS connection sync In-Scope Open None
Traffic 193445 Update Media dashboard in Grafana to use Prometheus metrics In-Scope Open None
Traffic 134807 Replace test hostnames in datecenter-specific subdomains with dashed names In-Scope Open None
Traffic 105657 Expires header for load.php should be relative to request time instead of cache time In-Scope Open None
Traffic 152882 Many misc wikis lack mobile domains In-Scope Open None
Traffic 199711 Deploy a scalable service for ACME (LetsEncrypt) certificate management Screep Open None
Traffic 154801 Investigate varnishd child crashes when multiple nodes get depooled/pooled concurrently In-Scope Open None
Traffic 75944 Monitor Varnish caches on beta cluster have two varnishd process running In-Scope Open None
Traffic 146619 DNS domains registered to WMF no longer redirecting In-Scope Open None
Traffic 191183 Enable avatars in gerrit In-Scope Open None
Traffic 170567 Support TLSv1.3 In-Scope Open None
Traffic 192559 Establish timeline and methodology for upcoming deprecation of non-forward-secret ciphers and TLSv1.0 In-Scope Open None
Traffic 117618 Add restrictive CSP to upload.wikimedia.org In-Scope Open None
Traffic 175636 prometheus -> grafana stats for per-numa-node meminfo In-Scope Open None
Traffic 126281 [Regression] stats.wikipedia.org redirect no longer works ("Domain not served here") In-Scope Open None
Traffic 158599 Samsung Internet's desktop mode getting redirected to mobile site In-Scope Open None
Traffic 164327 replace ulsfo aging servers In-Scope Open None
Traffic 102848 Split GeoIP into a new component In-Scope Open None
Traffic 204355 Allow traffic team to manage the traffic blog on phame Screep Open None
Traffic 199675 cp5001 unreachable since 2018-07-14 17:49:21 Screep Open None
Traffic 192280 sda failure in hydrogen.wikimedia.org In-Scope Open None
Traffic 204013 Horizon Designate dashboard not allowing creation of NS records Screep Open None
Traffic 150479 Prometheus varnish metric churn due to VCL reloads In-Scope Open None
Traffic 199247 Decommission baham Screep Open None
Traffic 179027 Puppetize LVS interface IP sets per-DC for easy use in ferm rules In-Scope Open None
Traffic 106517 upload.wikimedia.org returns HTTP status code 503 for truncated urls, not 404 In-Scope Open None
Traffic 81305 Make PyBal respect advertised BGP capabilities In-Scope Open None
Traffic 164768 Explicitly limit varnishd transient storage In-Scope Open None
Traffic 200806 cp3031: Power required by the system exceeds the power supplied by the Power Supply Units Screep Open None
Traffic 86915 nan and minnan subdomain redirects are a mess In-Scope Open None
Traffic 54253 Protocol-relative URLs are poorly supported or unsupported by a number of HTTP clients In-Scope Open None
Traffic 190993 Upgrade pybal-test instances to stretch In-Scope Open None
Traffic 178173 Renew unified certificates 2017 In-Scope Open None
Traffic 144187 Better handling for one-hit-wonder objects In-Scope Open None
Traffic 174960 Varnish does not vary elasticsearch query by request body In-Scope Open None
Traffic 203396 certcentral: challenge checking on *all* pooled backend hosts Screep Open None
Traffic 104442 Investigate better DNS cache/lookup solutions In-Scope Open None
Traffic 200673 varnish-http-requests false positives when a DC is depooled Screep Open None
Traffic 107236 Switch port 80 to nginx on primary clusters In-Scope Open None
Traffic 159411 Uniform cluster nomenclature across puppet In-Scope Open None
Traffic 127482 Enable VCL source-DC switching via confd In-Scope Open None
Traffic 104681 HTTPS Plans (tracking / high-level info) In-Scope Open None
Traffic 159412 Convert all of our site.pp/roles to the role/profile paradigm In-Scope Open None
Traffic 127387 Split slash decoding from general percent normalization in Varnish VCL In-Scope Open None
Traffic 204365 Stop oversampling Asian countries Screep Open None
Traffic 131930 Set SPF (... -all) for toolserver.org In-Scope Open None
Traffic 198152 Size of headers processed by varnish? In-Scope Open None
Traffic 50133 ForeignAPIRepo wrongly returns non-protocol-relative URLs for original "thumbs" In-Scope Open None
Traffic 174432 Unclear LVS bandwidth graph in "load balancers" dashboard In-Scope Open None
Traffic 203423 certcentral: Provide script for certificate revocation Screep Open None
Traffic 164456 Migrate to nginx-light In-Scope Open None
Traffic 173966 Like nan.wikipedia.org, redirect other nan.*.org to the proper zh-min-nan.*.org domains In-Scope Open None
Traffic 150022 thumb.php should not set CC:no-cache on renderer 404 responses? In-Scope Open None
Traffic 138591 Backport iproute2 4.x from debian testing -> our jessie In-Scope Open None
Traffic 175319 cp1066 unexplained 503 spikes In-Scope Open None
Traffic 134447 letsencrypt puppetization: upgrade for scalability In-Scope Open None
Traffic 102367 Migrate tools.wmflabs.org to https only (and set HSTS) In-Scope Open None
Traffic 167906 Make API usage limits easier to understand, implement, and more adaptive to varying request costs / concurrency limiting In-Scope Open None
Traffic 138546 Backend naming in VCL needs to use fqdn+port In-Scope Open None
Traffic 204931 Re-evaluate use of EV certificates for payments.wm.o? Screep Open None
Traffic 163541 cache hosts should auto-repool iff OCSP files are sane In-Scope Open None
Traffic 120631 Security: Is it safe to enable Zero spoofing In-Scope Open None
Traffic 174342 Missing IP addresses for Maroc Telecom In-Scope Open None
Traffic 196560 rack/setup/install LVS200[7-10] In-Scope Open None
Traffic 149847 RFC: Use content hash based image / thumb URLs In-Scope Open None
Traffic 133178 RESTBase support for www.wikimedia.org missing In-Scope Open None
Traffic 175203 Implement stateless TCP balancing in our LVS servers In-Scope Open None
Traffic 177742 Investigate Chrony as a replacement for ISC ntpd In-Scope Open None
Traffic 125170 Internal DNS resolver responds with NXDOMAIN for localhost AAAA In-Scope Open None
Traffic 202040 Decommission radon Screep Open None
Traffic 192437 Pybal support of configuration from the kubernetes API In-Scope Open None
Traffic 188087 Some etcd connections not established at startup In-Scope Open None
Traffic 128409 Detect tools.wmflabs.org tools which are HTTP-only In-Scope Open None
Traffic 204997 certcentral: delay deployment of renewed certs to wait out skewed client clocks Screep Open None
Traffic 109776 Tilerator should purge Varnish cache In-Scope Open None
Traffic 161256 multi-component wmflabs.org subdomains doesn't work under simple wildcard TLS cert In-Scope Open None
Traffic 171850 Backport ipvsadm In-Scope Open None
Traffic 120486 add a https-only option to dynamicproxy In-Scope Open None
Traffic 129682 Look into solutions for replaying traffic to testing environment(s) In-Scope Open None
Traffic 131894 Collect Backend-Timing in Prometheus In-Scope Open None
Traffic 181368 Log source port for anonymous users and expose it for sysops/checkusers In-Scope Open None
Traffic 149873 CentralNotice: Review and update Varnish caching for Special:BannerLoader In-Scope Open 2.0
Traffic 205439 CI jobs for authdns linting need to run on Stretch Screep Open None
Traffic 184715 pybal's "can-depool" logic only takes downServers into account In-Scope Open None
Traffic 133821 Content purges are unreliable In-Scope Open None
Traffic 128358 Uploading 1.2GB ogv results in 503 In-Scope Open None
Traffic 177927 Refactor kafka_config.rb and and kafka_cluster_name.rb in puppet to avoid explicit hiera calls In-Scope Open None
Traffic 196248 TLS certificates renewal process In-Scope Open None
Traffic 204987 Consider adding Must-Staple header to enforce revocation checking Screep Open None
Traffic 177961 Upgrade LVS servers to stretch In-Scope Open None
Traffic 133895 Varnish configuration for mobile domains should be coherent with Apache configuration In-Scope Open None
Traffic 96499 dbtree loads third party resources (from jquery.com and google.com) In-Scope Open None
Traffic 148976 Strongswan Icinga check: do not report issues about depooled hosts In-Scope Open None
Traffic 202627 cp3036 PS Redundancy Lost Screep Open None
Traffic 199252 Search engines continue to link to JS-redirect destination after Wikipedia copyright protest Screep Open None
Traffic 201666 cp3040: kernel crash in ipsec code shortly after reboot Screep Open None
Traffic 164460 Use DNS discovery record for deployment CNAME In-Scope Open None
Traffic 171498 Implement machine-local forwarding DNS caches In-Scope Open None
Traffic 120509 Cache education dashboard pages In-Scope Open None
Traffic 119372 Pybal IdleConnectionMonitor with TCP KeepAlive shows random fails if more than 100 servers are involved. In-Scope Open None
Traffic 117435 Spike: CentralNotice: Verify that our Special:HideBanners cookie storm works as efficiently as possible In-Scope Open 2.0
Traffic 82849 lvs servers report 'Memory allocation problem' on bootup In-Scope Open None
Traffic 112316 Configure varnish to use "Unconfigured domain" page for 404 Not Served (instead of generic error) In-Scope Open None
Traffic 45250 Redo /beacon/impression system (formerly Special:RecordImpression) to remove extra round trips on all FR impressions (title was: S:RI should pyroperish) In-Scope Open None
Traffic 191017 Unwanted service startups and their triggers In-Scope Open None
Traffic 204225 ATS: log inspection at runtime Screep Open None
Traffic 148134 OCSP Stapling for Intermediates In-Scope Open None
Traffic 184942 Deprecate python varnish cachestats In-Scope Open None
Traffic 133548 Create a secure redirect service for large count of non-canonical / junk domains In-Scope Open None
Traffic 94125 Central login notice appears on unencrypted API format=*fm pages, where reloading does not affect login status In-Scope Open None
Traffic 133001 Decom legacy ex-parsoidcache cxserver, citoid, and restbase service hostnames In-Scope Open 0.0
Traffic 165764 Fully-redundant LVS clusters using Pybal per-service MED feature In-Scope Open None
Traffic 134324 confctl select needs a -y flag? In-Scope Open None
Traffic 96844 Update TLS/HTTP documentation on wikitech In-Scope Open None
Traffic 179026 LVS IPv6 IPs should all be recorded in DNS In-Scope Open None
Traffic 118468 point wikilovesmonuments.org ns to wmf In-Scope Open None
Traffic 137252 Redirect phabricator.mediawiki.org to phabricator.wikimedia.org In-Scope Open None
Traffic 205344 Inconsistent lists of labs-ns* nameservers Screep Open None
Traffic 180712 VCL: handling of uncacheable responses in wikimedia-common In-Scope Open None
Traffic 123854 Set up action API latency / error rate metrics & alerts In-Scope Open None
Traffic 184293 rack/setup/install lvs101[3-6] In-Scope Open None
Traffic 158604 Investigate usefulness of SameSite cookies for logged-in accounts In-Scope Open None
Traffic 134323 confctl: give regexen more freedom In-Scope Open None
Traffic 185350 Vet reliability of the response_size field for data analysis purposes In-Scope Open None
Traffic 155314 Varnish does not cache Action API responses when logged in In-Scope Open None
Traffic 165560 Artificial spike in offset of unique devices from November to February 6th on wikidata In-Scope Open None
Traffic 199677 cp3033 unreacheable since 2018-07-15 11:47:31 Screep Open None
Traffic 119038 Image cache issue when 'over-writing' an image on commons In-Scope Open None
Traffic 204056 Move wikimedia.ee under WM-EE Screep Open None
Traffic 74186 Varnish: Mobile site redirect interferes with OAuth authorization process In-Scope Open None
Traffic 23027 Requests with utf-8 in the URL return a outdated page revision In-Scope Open None
Traffic 205040 Show SVGs in page language if available Screep Open None
Traffic 165765 Refactor pybal/LVS config for shared failover In-Scope Open None
Traffic 170606 Add Accept header to webrequest logs Screep Open None
DBA 199124 Remove all usages of $::mw_primary on puppet Screep Done None
DBA 202822 db2088 rebooted itself and came back sick Screep Done None
DBA 193732 Decommission db1060 In-Scope Done None
DBA 199056 db1069 bad disk Screep Done None
DBA 205253 db1069 has errored disk in slot 7 Screep Done None
DBA 201493 Disk #9 with errors on db1068 (s4 master) Screep Done None
DBA 194118 Decommission db1055 In-Scope Done None
DBA 204311 Upgrade all core (mediawiki) database servers to mariadb 10.1 Elaborated Done None
DBA 201387 Upgrade pc2004 and pc2005 BIOS Screep Done None
DBA 197072 Physically move es1017 from D to C row In-Scope Done None
DBA 196606 Decommission db1059 Screep Done None
DBA 201603 db2069 storage crash Screep Done None
DBA 201132 es1019 mgmt interface DOWN Screep Done None
DBA 50930 Database replication problems - production and labs (tracking) In-Scope Done None
DBA 204462 Degraded disk on db1069 (x1 master) Screep Done None
DBA 200641 pc2006 rebooted itself Screep Done None
DBA 199636 Degraded RAID on db1072 Screep Done None
DBA 200287 Degraded RAID on db1069 Screep Done None
DBA 195228 db2064 crashed and totally broken - decommission it In-Scope Done None
DBA 199009 sql config differs between mwmaint1001 and deploy1001 Screep Done None
DBA 201245 Degraded RAID on db2054 Screep Done None
DBA 193736 Decommission db1056 In-Scope Done None
DBA 202824 Degraded RAID on db2058 Screep Done None
DBA 135851 Preserve InnoDB table auto_increment on restart In-Scope Done None
DBA 203039 Storage of data for recommendation API Screep Done None
DBA 200059 db2061 disk with predictive failure Screep Done None
DBA 203623 Degraded RAID on db2053 Screep Done None
DBA 199861 Decommission db1052 Screep Done None
DBA 201021 HAproxy on dbproxy hosts lack enough logging Screep Done None
DBA 194634 Decommission db1053 Screep Done None
DBA 197063 Decommission db1054 In-Scope Done None
DBA 199759 Degraded RAID on db2061 Screep Done None
DBA 201133 db1069 (x1 master) memory errors Elaborated Done None
DBA 196690 rack/setup/install dbproxy101[2-7].eqiad.wmnet In-Scope Done None
DBA 165625 Evaluate future of wmf puppet module "mysql" In-Scope Done None
DBA 195484 Decommission db1051 Screep Done None
DBA 83609 script & docs to rename wiki databases In-Scope Open None
DBA 193224 Evaluate and decide the future of relational datastore at WMF after the upgrade of MariaDB 10.1 is finished In-Scope Open None
DBA 109179 Migrate MySQLs to use ROW-based replication In-Scope Open None
DBA 112473 Better mysql monitoring for number of connections and processlist strange patterns In-Scope Open None
DBA 189107 DB meta task for next DC failover issues In-Scope Open None
DBA 196055 Remove table `math` from the database In-Scope Open None
DBA 196378 Investigate solutions for MySQL connection pooling In-Scope Open None
DBA 160731 Decom db1048 (BBU Faulty - slave lagging) In-Scope Open None
DBA 177779 Generate instance list of database hosts to be monitored automatically from exported resources In-Scope Open None
DBA 143896 MySQL metrics monitoring In-Scope Open None
DBA 196547 [Epic] Extension:JADE scalability concerns In-Scope Open None
DBA 107610 Setup separate logical External Store for Flow in production In-Scope Open None
DBA 152427 Create a check/calendar alert for MariaDB TLS certs In-Scope Open None
DBA 133523 [RFC] improve parsercache replication and sharding handling In-Scope Open None
DBA 165677 Create a backend check for pybal to monitor the MySQL protocol being up In-Scope Open None
DBA 202596 Write our anticipated "phase two" schemas and submit for review Elaborated Open None
DBA 112282 Multiple pages with no revisions In-Scope Open None
DBA 100501 mysql user and group should be a system user/group In-Scope Open None
DBA 200297 Introduce a new namespace for collaborative judgments about wiki entities Elaborated Open None
DBA 202051 db2042 (m3) master RAID battery failed Screep Open None
DBA 204026 DBPerformance warning "Query returned 22186 rows: query: SELECT * FROM `translate_metadata`" on Meta-Wiki Screep Open None
DBA 175672 Make apache/maintenance hosts TLS connections to mariadb work In-Scope Open None
DBA 157702 Followup for TLS MariaDB server roll-out In-Scope Open None
DBA 197531 Data model for dbconfig In-Scope Open None
DBA 141547 Setup automatic failover for misc database servers In-Scope Open None
DBA 205780 db1067 (enwiki master) disk #7 with errors Screep Open None
DBA 119626 Eliminate SPOF at the main database infrastructure In-Scope Open None
DBA 126252 Populate the wikishared db on all dbstores In-Scope Open None
DBA 119154 Move echo tables from local wiki databases onto extension1 cluster for mediawikiwiki, metawiki, and officewiki In-Scope Open None
DBA 197126 Create tool to handle the state of database configuration in MediaWiki in etcd In-Scope Open None
DBA 164834 In some database hosts, performance schema loses digest statistics In-Scope Open None
DBA 185084 Allow use of EtcdConfig to configure slave databases In-Scope Open None
DBA 161754 eqiad: (2) hardware access request for labsdb1004 & 5 refresh In-Scope Open None
DBA 184805 Move some wikis to s5 In-Scope Open None
DBA 166108 x1 master db1031: Faulty BBU In-Scope Open None
DBA 104699 Firewall configurations for database hosts In-Scope Open None
DBA 204928 Issues with mgmt interface on es2001 host Screep Open None
DBA 134809 Apache <=> mariadb SSL/TLS for cross-datacenter writes In-Scope Open None
DBA 141968 Display lag on grafana (prometheus) and dbtree from pt-heartbeat instead (or in addition) of Seconds_Behind_Master In-Scope Open None
DBA 162070 Cleanup or remove mysql puppet module; repurpose mariadb module to cover misc use cases In-Scope Open None
DBA 161755 eqiad: (2) hardware access request for labsdb1006 & 7 refresh In-Scope Open None
DBA 176532 Gerrit is failing to connect to db on gerrit2001 thus preventing systemd from working In-Scope Open None
DBA 205514 db1092 crashed - BBU broken Screep Open None
DBA 145072 Create a script to regenerate prometheus mysqld exporter listing that works with puppetdb In-Scope Open None
DBA 127570 Rename be_x_oldwiki database to be_taraskwiki In-Scope Open None
Software Development 203948 Covert deploy_apache_change.sh to a spicerack cookbook Screep Open None
Software Development 159045 Update Puppet repo code that uses maniphest.update and maniphest.createtask conduit api In-Scope Open None
Software Development 201669 wmf-auto-reimage should retry on ipmi failures Screep Open None
Software Development 182028 DNS repo: add CI checks for obvious configuration errors In-Scope Open None
Software Development 167504 New tool to track package updates/status for hosts and images (debmonitor) In-Scope Open None
Software Development 157133 Consider adding a --skip-conftool option to puppet-merge In-Scope Open None
Software Development 203964 Create a spicerack cookbook to empty a ganeti node from VMs Screep Open None
Software Development 203944 Create a spicerack cookbook for restoring an etcd cluster from backups Screep Open None
Software Development 177385 Upgrade Cumin masters to stretch In-Scope Open None
Software Development 199911 Systemd session creation fails under I/O load Screep Open None
Software Development 201346 rack/setup/install clustermgmt1001.eqiad.wmnet (new cumin master) Screep Open None
Software Development 203963 Convert makevm to spicerack cookbook Screep Open None
Software Development 201317 wmf-auto-reimage: 'execution expired' on first puppet run Screep Open None
Software Development 152950 E901 SyntaxError: invalid syntax is wrongly raised on using python's abc by jenkins python CI linter In-Scope Open None
Software Development 184435 Puppet tox: properly lint both Py2 and Py3 files In-Scope Open None
Software Development 144169 Flake8 for python files without extension in puppet repo In-Scope Open None
Software Development 164587 cumin could use randomization/splay options In-Scope Open None
Software Development 157002 Puppet compiler: re-add the concurrency option NUM_THREADS In-Scope Open None
Software Development 204789 wmf-auto-reimage tries to remove from Debmonitor even with --new Screep Open None
Software Development 148494 Add shell scripts CI validations In-Scope Open None
Software Development 198850 debmonitor: Race condition between package updated triggered by apt hook and daily cron run Screep Open None
Software Development 150560 More verbose messages from service-checker-swagger In-Scope Open None
Software Development 155705 confctl: log to SAL even if the selection doesn't match any host In-Scope Open None
Software Development 157001 Puppet compiler: abort on git rebase conflict In-Scope Open None
Software Development 203943 Convert automation scripts to spicerack cookbooks Screep Open None
Software Development 154776 Puppet compiler: order resources for easy comparison between hosts In-Scope Open None
Hardware Requests 198169 None In-Scope Cut None
Hardware Requests 196345 None In-Scope Cut None
Hardware Requests 203087 Decommission Ganeti vm meitnerium.wikimedia.org (old Archiva host) Screep Done None
Hardware Requests 201938 Request for swift ms-be refresh Screep Done None
Hardware Requests 181264 Refresh or replace oxygen In-Scope Open None
Hardware Requests 199673 eqiad | (14 + 6) hadoop hardware refresh and expansion Screep Open None
Hardware Requests 204589 eqiad: (1) misc single cpu server allocation for performance browser testing Screep Open None
Hardware Requests 205092 Refresh (leased) restbase2001-2006 Screep Open None
Hardware Requests 199674 eqiad | (3) Labs Data Lake hardware Screep Open None
Hardware Requests 139775 eqiad: add all spare network switches to hardware spares tracking In-Scope Open None
Other Operations 169570 nfs-manage failover script needs to be tested with real load and fixed In-Scope Cut None
Other Operations 192948 Upgrade prometheus-jmx-exporter on all services using it In-Scope Cut None
Other Operations 199853 Increase webperf1002/webperf2002 space from 50GB to 150GB (Ganeti) Elaborated Done None
Other Operations 191438 Upgrade Puppet compilers to Stretch In-Scope Done None
Other Operations 201863 RESTBase dev environment (Cassandra) SSL certificates expired Screep Done None
Other Operations 164341 Decommission old memcached hosts - mc1001->mc1018 In-Scope Done None
Other Operations 201804 restbase2003 has a broken disk (at least) Screep Done None
Other Operations 205284 Degraded RAID on rdb1004 Screep Done None
Other Operations 198662 Request access to data for citation usage research Screep Done None
Other Operations 200924 Mailing list for Knowledge Integrity program Screep Done None
Other Operations 194176 wtp2020 correctable memory errors In-Scope Done None
Other Operations 193649 migrate elasticsearch to stretch (from jessie) In-Scope Done None
Other Operations 163402 Ensure we can survive a loss of labservices1001 In-Scope Done None
Other Operations 202069 Requesting access to view EventLogging data for Tonina WMDE Screep Done None
Other Operations 202963 eqiad (1) - VM request for Piwik/Matomo Screep Done None
Other Operations 197169 10G ports seem not to work on new HP hardware In-Scope Done None
Other Operations 201824 Spin up a new poolcounter node for ores Screep Done None
Other Operations 200177 Set a proper max open files limit for Kafka clusters Screep Done None
Other Operations 199965 Give Bmueller "grafana-admin" LDAP group access Screep Done None
Other Operations 168407 rack/setup/install labnodepool1002.eqiad.wmnet In-Scope Done None
Other Operations 198371 All "zh-my" variant page views get 404 Not Found on zh.wikipedia.org Screep Done None
Other Operations 193025 Decommision poolcounter1002 In-Scope Done None
Other Operations 205110 install additonal SSDs maps100[1-4] Screep Done None
Other Operations 201467 Growth Team Mailing List Screep Done None
Other Operations 173097 Decommission stat1002.eqiad.wmnet In-Scope Done None
Other Operations 199131 Please install Text::CSV_XS at stat1005 Screep Done None
Other Operations 202476 Give thiemowmde permission to upload wikidiff2 releases (releasers-wikidiff2) Screep Done None
Other Operations 194172 mw2213 correctable memory errors In-Scope Done None
Other Operations 199813 EventStreams accumulates too much memory on SCB nodes in CODFW Screep Done None
Other Operations 202737 Creation of a mailing list for the "Wiki Labs Culture" initiative Screep Done None
Other Operations 157030 cannot delete non-empty directory: php-1.29.0-wmf.3 messages on 'scap sync' on mwdebug1002 In-Scope Done None
Other Operations 175625 scs-c1-eqiad unresponsive In-Scope Done None
Other Operations 192551 atop on stretch overloading a host In-Scope Done None
Other Operations 202011 Move internal sites hosted on thorium to ganeti instance(s) Screep Done 13.0
Other Operations 202363 Requesting access to restricted production access and analytics-privatedata-users for Ty Hargrove Screep Done None
Other Operations 199441 Document the process for hard-deleting topics in kafka Screep Done None
Other Operations 204590 Add sbassett to security@ Screep Done None
Other Operations 185004 Decommission mw1201-mw1220 In-Scope Done None
Other Operations 169035 bast3002 sdb broken In-Scope Done None
Other Operations 203851 Trying to install updated versions of "linux-meta linux-meta-4.9" fails Screep Done None
Other Operations 204667 Ferm leftovers on labtestnet2003 Screep Done None
Other Operations 189566 Decommission eventlog1001 In-Scope Done None
Other Operations 202092 convert cloud VPS projects from apache to httpd module (wikidata-query/ldfclient) Screep Done None
Other Operations 201439 rename/reimage labnodepool1002.eqiad.wmnet as cloudservices1003.wikimedia.org Elaborated Done None
Other Operations 189890 add ssh key comparison to cross-validate-accounts.py In-Scope Done None
Other Operations 195423 Reduce false positive icinga alerts during host reimages In-Scope Done None
Other Operations 204790 nathante/groceryheist shell request for researchers, statistics-privatedata-users, analytics-privatedata-users Screep Done None
Other Operations 189435 Integrate stretch 9.4 point update In-Scope Done None
Other Operations 201671 Transient failures of IPMI commands to elastic2017 Screep Done None
Other Operations 193394 Degraded RAID on wasat In-Scope Done None
Other Operations 201185 Jmorgan production ssh revokation/replacement (due to key in use in production and cloud) Screep Done None
Other Operations 193121 Upgrade ganeti hosts to stretch In-Scope Done None
Other Operations 201344 rack/setup/install icinga1001.wikimedia.org Screep Done None
Other Operations 199436 Alert on negative disk space available Screep Done None
Other Operations 203576 Interface errors for stat1006 Screep Done None
Other Operations 195293 503 error attempting to open multiple projects (Wikipedia and meta wiki are loading very slowly) In-Scope Done None
Other Operations 185504 Netbox: add Icinga check for PostgreSQL In-Scope Done None
Other Operations 181205 Let quarry use the mariadb module In-Scope Done None
Other Operations 201920 wikimedia-us-mn administration password reset Screep Done None
Other Operations 202546 Requesting access to restricted production access for Bill Pirkle Screep Done None
Other Operations 136403 Move cp3030+ from OE14 to OE13 in racktables In-Scope Done None
Other Operations 198092 Broken apt config on kafka/analytics hosts In-Scope Done 3.0
Other Operations 152439 cronspam from labtestservices2001 /etc/dns-floating-ip-updater.py > /dev/null In-Scope Done None
Other Operations 202700 unrack/decom cr1-eqdfw Screep Done None
Other Operations 198398 mw1239 correctable memory errors In-Scope Done None
Other Operations 181634 Investigate overload condition, seems that we lose nodes In-Scope Done None
Other Operations 202498 Redirect 2030.wikimedia.org to the new movement strategy portal Screep Done None
Other Operations 152767 Missing Labs hiera entry in labs-private repo In-Scope Done None
Other Operations 196701 rack/setup/install torrelay1001.wikimedia.org In-Scope Done None
Other Operations 192561 Upgrade deployment-prep deployment servers to stretch In-Scope Done None
Other Operations 181988 Investigate and improve memory allocation rates of WDQS In-Scope Done None
Other Operations 203847 Requesting access to researchers for kharlan Screep Done None
Other Operations 201668 Requesting access to restricted production access and analytics-privatedata-users for Karen Brown Screep Done None
Other Operations 200951 Update redirect for jobs.wikimedia.org Screep Done None
Other Operations 198547 Spam on phabricator Screep Done None
Other Operations 164290 Set up external DNS record for wikitech-static In-Scope Done None
Other Operations 200800 NDA access for Telecom Paristech Research Team Screep Done None
Other Operations 193915 rename wasat to mwmaint2001 and reinstall it with stretch In-Scope Done None
Other Operations 176957 Decommission host copper.eqiad.wmnet In-Scope Done None
Other Operations 202006 mw2184 stuck after reboot Screep Done None
Other Operations 191360 decom spare server lawrencium/WMF3542 In-Scope Done None
Other Operations 200723 Remove expiry date from Morten Warncke-Wang's production shell access Screep Done None
Other Operations 204999 update label on an-master100[12].eqiad.wmnet Screep Done None
Other Operations 200317 grafana fails to load dashboards from disk Screep Done None
Other Operations 183891 Archive mediawiki/extensions/Collection/OfflineContentGenerator and all OCG-related repos Elaborated Done None
Other Operations 202072 Requesting Access to view EventLogging data for gabriel-wmde / gbirke Screep Done None
Other Operations 196666 rack/setup/add to spares tracking 2 single cpu misc class systems In-Scope Done None
Other Operations 169290 New anti-stackclash (4.9.25-1~bpo8+3 ) kernel super bad for NFS In-Scope Done None
Other Operations 200215 Create kafka topic for mjolinr bulk daemon and decide on cluster Screep Done None
Other Operations 94819 Audit racktables In-Scope Done None
Other Operations 201670 Wikimedia Community User Group Albania mailing list request Screep Done None
Other Operations 198391 migrate elasticsearch cirrus cluster to RAID0 In-Scope Done None
Other Operations 201952 operations-puppet:0.3.4 doesn't seem to be properly published Screep Done None
Other Operations 201772 maps.wikimedia.org is showing old vandalized version of OSM Screep Done None
Other Operations 198792 snapshot1005 does not power back up Screep Done None
Other Operations 202475 Give WMDE-Fisch permission to upload wikidiff2 releases (releasers-wikidiff2) Screep Done None
Other Operations 192996 Delete deployment-mediawiki06 Screep Done None
Other Operations 196651 rack upgraded storage capacity in labstore100[67].eqiad.wmnet In-Scope Done None
Other Operations 201849 Request production global root access for Effie Mouzeli Screep Done None
Other Operations 194408 Decom tellurium Screep Done None
Other Operations 199938 VipsScaler broken for MediaWiki production (/usr/bin/vips: No such file) Screep Done None
Other Operations 135338 On Trusty and Jessie PHP yields: PHP Deprecated: Comments starting with '#' are deprecated in /etc/php5/cli/conf.d/20-xhprof.ini on line 2 In-Scope Done None
Other Operations 199921 Relabel labnet1004.eqiad.wmnet as cloudnet1004.eqiad.wmnet Screep Done None
Other Operations 202784 add icinga1001 to allowed hosts for AQL SMS gateway Screep Done None
Other Operations 203607 Ensure Jenkins mail configuration supports outbound smtp server failover Elaborated Done None
Other Operations 204980 add onimisionipe to restricted group Screep Done None
Other Operations 115899 Move scap target configuration to etcd In-Scope Done None
Other Operations 203840 Add which ldap groups can login on netbox login form Screep Done None
Other Operations 205540 Upgrade to OTRS version 5.0.30 Screep Done None
Other Operations 203489 Onboard gtirloni to WMF Screep Done None
Other Operations 203222 Mailing list for Wiki Indaba Steering Committee Screep Done None
Other Operations 201196 analytics-privatedata-users access for Dario Rossi (username drossi) Screep Done None
Other Operations 193093 delete "wmfproduct" list Screep Done None
Other Operations 202486 Requesting access to restricted production access and analytics-privatedata-users for Kalliope Tsouroupidou Screep Done None
Other Operations 204532 Request removal of puppet3-diffs VPS project Elaborated Done None
Other Operations 200722 releng/mediawiki-phpcs-dryrun fails to upload to docker-registry.wikimedia.org Screep Done None
Other Operations 202362 Requesting access to restricted production access and analytics-privatedata-users for Samuel Guebo Screep Done None
Other Operations 202439 check cabling/config for payments1004 DRAC interface Screep Done None
Other Operations 196664 rack/setup/install authdns2001.wikimedia.org In-Scope Done None
Other Operations 196485 WDQS diskspace is low In-Scope Done None
Other Operations 173492 Tune Varnishkafka delivery errors to be more sensitive In-Scope Done None
Other Operations 193420 Decommission hafnium In-Scope Done None
Other Operations 165105 Some requests for DOIs are failing or very slow; if we have a DOI and the request is taking too long, just use CrossRef data instead. In-Scope Done None
Other Operations 202013 eqiad: (3) VM %request for internal analytics web sites Screep Done None
Other Operations 177958 Decommission ocg1001-3 In-Scope Done None
Other Operations 182015 Decommission Vanadium In-Scope Done None
Other Operations 203121 Update Debian package of Blubber (0.5.0-1) Screep Done None
Other Operations 199816 Sunset Watchmouse's status.wikimedia.org Screep Done None
Other Operations 202617 Update terms "Labs" and "Operations" in L3 Screep Done None
Other Operations 200666 Please add php-imagick and php-redis packages to apt.wikimedia.org thirdparty/php72 Screep Done None
Other Operations 200954 Lost access to archiva Screep Done None
Other Operations 204173 apply hostname label to cumin2001 / wmf6407 and update visible label field in racktables Screep Done None
Other Operations 202657 request to add imarlier to perf-roots Screep Done None
Other Operations 201440 Decommission labtestnet2001.codfw.wmnet Screep Done None
Other Operations 201667 Requesting access to restricted production access and analytics-privatedata-users for Patrick Earley Screep Done None
Other Operations 196483 rack/setup/install graphite2003 In-Scope Done None
Other Operations 175361 Upgrade mx1001/mx2001 to stretch In-Scope Done None
Other Operations 204491 Heating alerts / memory errors on mw1254 Screep Done None
Other Operations 204772 elastic200[456] suddenly offlined Screep Done None
Other Operations 200660 Upload python-pykube deb to apt.wikimedia.org Screep Done None
Other Operations 205366 mcelog is deprecated in kernel >= 4.12 Screep Done None
Other Operations 196916 Phabricator outbound email seems to have a SPOF of mx1001 In-Scope Done None
Other Operations 198785 Degraded RAID on cp3043 Screep Done None
Other Operations 167377 Decommission cp4011, cp4012, cp4019, cp4020 In-Scope Done None
Other Operations 195289 Add Addshore & possibly other WMDE devs/deployers to the wikidata icinga contact list In-Scope Done None
Other Operations 202397 ms-be2020 crashed Screep Done None
Other Operations 202473 Create releasers-wikidiff2 group, split from releasers-mediawiki Screep Done None
Other Operations 201148 dbproxy1006 iDRAC IP conflict Screep Done None
Other Operations 199937 Thumbs from VipsScaler fail (HTTP 500; 10.2.1.21 refuses port 80) Screep Done None
Other Operations 191921 mwscript rebuildLocalisationCache.php takes 40 minutes on HHVM (rather than ~5 on PHP 5) In-Scope Done None
Other Operations 202521 Onboarding Balazs Pocze Screep Done None
Other Operations 201737 docker-registry is returnning HTTP 403 Forbidden for all requests Screep Done None
Other Operations 202779 add SSDs to wdqs100[45] Screep Done None
Other Operations 169680 NFS on dataset1001 overloaded, high load on the hosts that mount it In-Scope Done None
Other Operations 191364 decom spare server osmium/wmf4546 In-Scope Done None
Other Operations 198407 Degraded RAID on labstore1007 In-Scope Done None
Other Operations 205034 apply hostname labels to an-coord1001/wmf7621 Screep Done None
Other Operations 204564 cloudvps: toolserver-legacy project trusty deprecation Screep Done None
Other Operations 152724 Current state and next steps for RESTBase storage In-Scope Done None
Other Operations 200092 Degraded RAID on ms-be1016 Screep Done None
Other Operations 198049 Investigate possible outage on wikidata on 25th June - 04:13AM UTC - 05:27AM UTC In-Scope Done None
Other Operations 202478 php7.2-cli in thirdparty/php72 isn't installable due to libargon2-1 dependency Screep Done None
Other Operations 203966 Open Foundation West Africa (OFWA) mailing list Screep Done None
Other Operations 133744 Epic: switch Maps to production status In-Scope Done None
Other Operations 198058 Integrate jessie 8.11 point update In-Scope Done None
Other Operations 162850 CPU throttling on DELL PowerEdge R320 In-Scope Done None
Other Operations 203776 Successfully switch backend traffic (MediaWiki, Swift, RESTBase, Parsoid and services) to be served from codfw Elaborated Done None
Other Operations 174431 Upgrade mw* servers to Debian Stretch (using HHVM) In-Scope Done None
Other Operations 199757 Unable to access SWAP notebooks using LDAP Screep Done None
Other Operations 202247 Password reset request for wikimedia-nd mailing list Screep Done None
Other Operations 203626 deploy1001 can't talk to memcached, breaking invalidation of RL localization cache Elaborated Done None
Other Operations 187467 Decommission mw2017 In-Scope Done None
Other Operations 203055 en.planet hasn't updated since July 25 Screep Done None
Other Operations 196873 ms-be1036 in power off status, not responsive to power on commands In-Scope Done None
Other Operations 184832 Decommission labsdb1001 and labsdb1003 In-Scope Done None
Other Operations 48254 ircecho should support nickserv registration Screep Done None
Other Operations 204812 mc1021 boot failure Screep Done None
Other Operations 203182 Requesting access to EventLogging in Hive (analytics-privatedata-users) for Cicalese Screep Done None
Other Operations 170150 Evaluate Grafana's LDAP group options and deprecate grafana-admin if possible In-Scope Done None
Other Operations 203108 Create keyholder gerrit repo Screep Done None
Other Operations 204302 db1062 management interface busy (no sessions allowed) Screep Done None
Other Operations 201350 Access to dumps servers Screep Done None
Other Operations 201367 rack/setup/add to spares tracking 2 dual cpu misc system Screep Done None
Other Operations 202704 Add a few more PHP 7.2 packages for Toolforge in thirdparty/php72 Screep Done None
Other Operations 197146 Disk predictive failure on db2052 In-Scope Done None
Other Operations 199203 Relabel labvirt1022.eqiad.wmnet as cloudvirt1022.eqiad.wmnet Screep Done None
Other Operations 200307 Jenkins builds using autopkgtest stuck at "Not enough random bytes available" Screep Done None
Other Operations 201913 new ssh key for daniel Screep Done None
Other Operations 201991 Broken memory on elastic1029 Screep Done None
Other Operations 205661 change my email address in the techcom alias Screep Done None
Other Operations 202063 Requesting access to view EventLogging data for Tim WMDE Screep Done None
Other Operations 192103 Decommission notebook1001 In-Scope Done None
Other Operations 202503 Phabricator: Allow aklapper to delete personal Herald filter rules Screep Done None
Other Operations 202563 Access to restbase servers (including sudo) for Imarlier Screep Done None
Other Operations 194060 decommission dataset1001, ms1001 Screep Done None
Other Operations 202559 Allow ganeti instance inside of the Analytics VLAN; move analytics-tool* to it and change IPs. Screep Done None
Other Operations 202623 Reboots of dumps/snapshot hosts for L1TF/microcode updates Screep Done None
Other Operations 201494 Fix permissions of /srv/mediawiki-staging/private/README_BEFORE_MODIFYING_ANYTHING on mwdeploy1001 Screep Done None
Other Operations 201856 Subscribe user mepps to security@wikimedia.org Screep Done None
Other Operations 169020 Decommission cp400[1-4] In-Scope Done None
Other Operations 56515 Apply editing rate limits for all users In-Scope Done None
Other Operations 202650 Please add aaron to perf-team Screep Done None
Other Operations 198042 WDQS timeout on the public eqiad cluster In-Scope Done None
Other Operations 200763 Decom/reclaim terbium Screep Done None
Other Operations 203494 Requesting access to Root for Giovanni Tirloni Screep Done None
Other Operations 202780 add SSDs to wdqs1003 Screep Done None
Other Operations 198728 Degraded RAID on db2056 Screep Done None
Other Operations 204156 setup/install cumin2001.eqiad.wmnet Screep Done None
Other Operations 182497 Update log config for scb* boxes, to deal with ORES verbose logging In-Scope Done None
Other Operations 203561 Remove user "albe" from the wmde LDAP group Screep Done None
Other Operations 200203 labvirt1003 raid warning Screep Done None
Other Operations 204493 db1061 management interface busy (no sessions allowed) Screep Done None
Other Operations 199524 Relabel labnet1003.eqiad.wmnet as cloudnet1003.eqiad.wmnet Screep Done None
Other Operations 201187 Thumbnails don't seem to be being created/saved for id_internalwikimedia Screep Done None
Other Operations 155683 Close https://lists.wikimedia.org/mailman/listinfo/cep and keep the archive for now In-Scope Done None
Other Operations 184522 To purchase for next esams visit In-Scope Done None
Other Operations 196920 Add email queueing/failover to services currently using mail_smarthost[0] In-Scope Done None
Other Operations 201355 bast1002 - hardware (memory) issue Screep Done None
Other Operations 203465 Site: 4 VM request for ORES poolcounter Screep Done None
Other Operations 204393 Add admins to mailing list engineering@ Screep Done None
Other Operations 202166 Check/replace PEM2 on cr2-codfw Screep Done None
Other Operations 195370 Deploy FileExporter and FileImporter to group0 In-Scope Done 8.0
Other Operations 200330 Cannot SSH to stat1004 Screep Done None
Other Operations 202120 mjolnir-kafka-bulk-daemon failed on all elastic / eqiad nodes Screep Done None
Other Operations 201454 update ssh keys for amire80 - August 2018 Screep Done None
Other Operations 200895 eqiad: (1) VM request for Archiva Screep Done None
Other Operations 202708 Onboarding Mathew Onipe Screep Done None
Other Operations 199132 Relabel labvirt1021.eqiad.wmnet as cloudvirt1021.eqiad.wmnet Screep Done None
Other Operations 191352 decom zinc/WMF3298 In-Scope Done None
Other Operations 199801 Update wikidiff2 library on the WMF production cluster to v1.7.2 Screep Done 1.0
Other Operations 200799 Add email addresses for new techcom members to techcom@wikimedia.org Screep Done None
Other Operations 201757 Degraded RAID on db2033 Screep Done None
Other Operations 203271 Update Debian Package for Scap to 3.8.5-1 Screep Done None
Other Operations 203546 Alert when elasticsearch has shards larger than a maximum size Screep Done None
Other Operations 196175 decom/reclaim tin In-Scope Done None
Other Operations 203404 Degraded RAID on elastic2012 Screep Done None
Other Operations 146841 Reach out to Google about @yahoo.com emails not reaching gmail inboxes (when sent to mailing lists) In-Scope Done None
Other Operations 197450 test.wp is using test2.wp's message cache In-Scope Done None
Other Operations 163438 VisualEditor broken on wikitech when codfw is primary: "Error loading data from server: apierror-visualeditor-docserver-http: HTTP 500." Screep Done None
Other Operations 198805 Update wikitech docs to use new mwmaint1001 instead of terbium Elaborated Done None
Other Operations 199318 tegmen is down Screep Done None
Other Operations 199233 Give access to graphite and grafana-admin to Aleksey Bekh-Ivanov (WMDE) Screep Done None
Other Operations 190333 Backport libvpx 1.7.0, ffmpeg packages for VP9 -row-mt option In-Scope Done None
Other Operations 191153 decom bast1001 In-Scope Done None
Other Operations 182016 Decommission server zinc In-Scope Done None
Other Operations 199594 Exception "Job queue is read-only" Screep Done None
Other Operations 196787 Deactivate Chad's Racktables account In-Scope Done None
Other Operations 199967 Add Lea Voget (WMDE) & Bmueller to the WMDE LDAP group Screep Done None
Other Operations 192092 setup replacements for maintenance_server (terbium, wasat) on Stretch In-Scope Done None
Other Operations 127825 Re-add intel-microcode In-Scope Done None
Other Operations 183814 Degraded RAID on bast3002 In-Scope Done None
Other Operations 174720 letsencrypt::cert::integrated and non-http servers In-Scope Done None
Other Operations 191348 decommission uranium/WMF3128 In-Scope Done None
Other Operations 200080 Relabel labcontrol1003.wikimedia.org as cloudcontrol1003.wikimedia.org Screep Done None
Other Operations 203290 syncing Ubuntu mirror fail Screep Done None
Other Operations 201986 cassandra-a instance on aqs1007 is not starting Screep Done None
Other Operations 204604 Add "do not use this server" login message to non active mwmaint* server Screep Done None
Other Operations 201199 analytics-privatedata-users access for Flavia Salutari Screep Done None
Other Operations 197630 decommission samarium.frack.eqiad.wmnet In-Scope Done None
Other Operations 199782 Relabel labcontrol1004.wikimedia.org as cloudcontrol1004.wikimedia.org Screep Done None
Other Operations 204389 Update wasat/mwmaint2001 docs on Wikitech Screep Done None
Other Operations 138866 Update & standardize Platform-specific_documentation for HP servers In-Scope Done None
Other Operations 202910 add performance team members to webserver_misc_static servers to maintain sitemaps Screep Done None
Other Operations 181763 Decommission niobium In-Scope Done None
Other Operations 201971 Shorten logstash retention temporarily Screep Done None
Other Operations 199283 Update Debian Package for Scap3 to 3.8.4-1 Screep Done None
Other Operations 196252 Labservices1001 crashing, probable overheating In-Scope Done None
Other Operations 198051 Enable async logging on Wikidata Query Service In-Scope Done None
Other Operations 200888 labvirt1009 has high CPU, disk I/O and skyrocketted load Screep Done None
Other Operations 201470 Add contint-roots to releases{1,2}001 Screep Done None
Other Operations 194835 mw2182 crash In-Scope Done None
Other Operations 201341 rack/setup/install cloudservices1004.wikimedia.org Screep Done None
Other Operations 204004 Rename labvirt1019 and cloudvirt1020 to cloudvirt1019 and cloudvirt1020 Screep Done None
Other Operations 190184 Netbox: setup backups In-Scope Done None
Other Operations 199966 Give Lea Voget (WMDE) "grafana-admin" LDAP group access Screep Done None
Other Operations 202301 Release and deploy wikidiff2 v1.7.3 Screep Done None
Other Operations 202658 request to add phedenskog to perf-roots Screep Done None
Other Operations 197021 decommission snapshot1001 In-Scope Done None
Other Operations 198887 Request for languageconverter@lists.wikimedia.org Screep Done None
Other Operations 196417 Rack/Setup frbast2001.frack.codfw.wmnet In-Scope Done None
Other Operations 202100 Intermittent git-fat failure during deploy Screep Done None
Other Operations 196901 Replace memory bank on scb1002 In-Scope Done None
Other Operations 133091 Highest SSTables / read thresholds In-Scope Done None
Other Operations 199493 Add BPirkle to wmf ldap group Screep Done None
Other Operations 187194 zotero translation server: code stewardship request In-Scope Done None
Other Operations 199353 kafka eqiad cluster keeps crashing Screep Done None
Other Operations 168559 decom silver (was silver has trouble rebooting) Screep Done None
Other Operations 199489 Helm test failing for CI namespace Screep Done None
Other Operations 202778 add ssds to wdqs2003 Screep Done None
Other Operations 201929 Mailing list for Wikimedians of Tamazight User Group Screep Done None
Other Operations 202777 add SSDs to wdqs200[12] Screep Done None
Other Operations 191351 decom vanadium/WMF3291 In-Scope Done None
Other Operations 188377 Import some Analytics git puppet submodules to operations/puppet In-Scope Done 8.0
Other Operations 204960 add onimisionipe to maps-admin Screep Done None
Other Operations 201816 Onboarding Effie Mouzeli Screep Done None
Other Operations 179371 Move deployment-prep redis instances to stretch In-Scope Done None
Other Operations 201761 Degraded RAID on db2039 Screep Done None
Other Operations 196034 Define scap::sources in a way that is shared between prod and beta In-Scope Open None
Other Operations 158837 Consolidate performance website and related software In-Scope Open None
Other Operations 203792 Create a mailling list for Wiki Loves Love Screep Open None
Other Operations 150771 Secondary production Jenkins for CI In-Scope Open None
Other Operations 126989 MediaWiki logging & encryption In-Scope Open None
Other Operations 161004 Remove disabled users from internal mailing lists In-Scope Open None
Other Operations 159480 Decommission bast3001 In-Scope Open None
Other Operations 175710 Add profiling for Varnish and VCL In-Scope Open None
Other Operations 194997 Track more detailed disk usage on maps servers In-Scope Open None
Other Operations 76203 Make ircecho run as its own user In-Scope Open None
Other Operations 204479 Heating alerts on kafka1014 Screep Open None
Other Operations 101912 Network segmentation for WMF servers In-Scope Open None
Other Operations 151304 tmpreaper possible race condition In-Scope Open None
Other Operations 150823 Puppet CA rollover In-Scope Open None
Other Operations 136603 Update limit.sh to support systemd-based cgroup management In-Scope Open None
Other Operations 205694 I get a "403 Forbidden" error when subscribing to a list Screep Open None
Other Operations 177196 Port non-deprecated Diamond collectors to Prometheus In-Scope Open None
Other Operations 204907 Scap is checking canary servers in dormant instead of active-dc Screep Open None
Other Operations 191362 decom promethium/WMF3571 In-Scope Open None
Other Operations 140879 503 error raises again while trying to load a Wikidata page In-Scope Open None
Other Operations 163033 Create grafana dashboard for video scaler job runners In-Scope Open None
Other Operations 170152 mc2023 / mc2025 fail to mount root partition within 90 seconds using Linux 4.9 In-Scope Open None
Other Operations 140141 Install mscorefonts on scaling servers for SVG rendering In-Scope Open None
Other Operations 191388 Puppet: tracking catalogs that changes at every run In-Scope Open None
Other Operations 181621 What is causing ORES celery workers to suddenly require more CPU? In-Scope Open None
Other Operations 126083 overhaul labstore setup [tracking] In-Scope Open None
Other Operations 166081 rack/setup/install conf1004-conf1006 In-Scope Open None
Other Operations 198790 Relabel hooft to bast3002 Screep Open None
Other Operations 174959 swift-recon-cron on ms-be203[34]: [Errno 17] File exists: '/var/lock/swift-recon-object-cron' In-Scope Open None
Other Operations 94215 decommission cp3001 & cp3002 In-Scope Open None
Other Operations 134875 udpmxircecho spam/not working if unable to connect to irc server In-Scope Open None
Other Operations 203786 Mcrouter periodically reports soft TKOs for mc[1,2]035 leading to MW Memcached exceptions Screep Open None
Other Operations 138821 extend existing graphite whisper files retention to five years In-Scope Open None
Other Operations 168967 Upload shiny-server .deb to our Jessie apt repository In-Scope Open None
Other Operations 195553 Puppet class systemd needs to throw a more useful error In-Scope Open None
Other Operations 141520 "MediaWiki exceptions and fatals per minute" alarm is too slow (half an hour delay!) In-Scope Open None
Other Operations 161598 Monitor HHVM bytecode cache depletion on mediawiki app servers In-Scope Open None
Other Operations 196968 Re-organize the apache configuration for MediaWiki in puppet In-Scope Open None
Other Operations 200210 Decom graphite2002 Screep Open None
Other Operations 183236 After reimage Puppet order: sudo command failed In-Scope Open None
Other Operations 159242 Segmentation fault creating thumbnail In-Scope Open None
Other Operations 142821 Synchronise groups defined in data.yaml to LDAP In-Scope Open None
Other Operations 83729 Fix monitoring of poolcounter service In-Scope Open None
Other Operations 204088 Prometheus resources in deployment-prep to create grafana graphs of EventLogging Screep Open None
Other Operations 124179 Improve access to and control over incident and metrics monitoring infrastructure In-Scope Open None
Other Operations 199413 Systemd restart loop of timer filled the disk on tegmen Screep Open None
Other Operations 154627 Production error message (when servers are down) points users to donate link which is likely to produce the same error message In-Scope Open None
Other Operations 84700 Setup management switch in OE12 In-Scope Open None
Other Operations 186416 Allow selecting which images to build In-Scope Open None
Other Operations 152445 Move prometheus entry point off port 80 In-Scope Open None
Other Operations 184655 logstash group1 dashboard incorrectly shows testwikidatawiki In-Scope Open None
Other Operations 163996 Icinga check for ipv6 host reachability In-Scope Open None
Other Operations 188601 Gain visibility into httpd mod_proxy actions In-Scope Open None
Other Operations 115757 document debian packaging guidelines In-Scope Open None
Other Operations 128821 reclaim and return all cisco servers In-Scope Open None
Other Operations 153816 apache::static_site is not working In-Scope Open None
Other Operations 200960 Logstash packet loss Screep Open None
Other Operations 161834 Undo special tools-home and tools-project share definitions for NFS In-Scope Open None
Other Operations 177914 Switch labstore servers to default SSH configuration In-Scope Open None
Other Operations 190111 VirtualHost for mod_status breaks debugging Apache/MediaWiki from localhost In-Scope Open None
Other Operations 179395 Cluster puppet variable and ganglia decommission In-Scope Open None
Other Operations 194249 kafka1023 correctable memory errors In-Scope Open None
Other Operations 130883 decom cp3011-22 (12 machines) In-Scope Open None
Other Operations 138799 Create a simple puppet role for setting up a singlenode kubernetes install In-Scope Open None
Other Operations 161920 logrotate for ruthenium In-Scope Open None
Other Operations 164238 move icinga contacts file to public repo In-Scope Open None
Other Operations 171482 Programmatic generation of grafana dashboards In-Scope Open None
Other Operations 136094 Race condition in setting net.netfilter.nf_conntrack_tcp_timeout_time_wait In-Scope Open None
Other Operations 157306 Fix config file handling for /etc/hhvm/php.ini In-Scope Open None
Other Operations 117508 Make ops-l a list for humans again (no cheating) In-Scope Open None
Other Operations 203883 Implement MTA-STS Screep Open None
Other Operations 181200 Use "Charter" as preferred typeface on Electron In-Scope Open None
Other Operations 186288 replace all Ubuntu (trusty) hosts in production with Debian In-Scope Open None
Other Operations 182331 [Epic] Deploy ORES in kubernetes cluster In-Scope Open None
Other Operations 151317 stat user crontab on stat hosts for old file removal In-Scope Open None
Other Operations 156143 High CPU usage from swift-proxy on frontend machines In-Scope Open None
Other Operations 203092 Create Graphoid .pipeline files Screep Open None
Other Operations 149643 Review Icinga alarms with disabled notifications In-Scope Open None
Other Operations 126295 Spike: What do we have to package to run the Programs and Events dashboard on production? In-Scope Open None
Other Operations 155209 Increase $wgHTTPImportTimeout to a higher value on WMF wikis In-Scope Open None
Other Operations 84279 Admin module should allow group management of system users In-Scope Open None
Other Operations 123918 'swift' user/group IDs should be consistent across the fleet In-Scope Open None
Other Operations 205577 Puppet agent takes a long time to finish when adding IPv6 addresses Screep Open None
Other Operations 185189 scap sudo violation on first puppet run In-Scope Open None
Other Operations 165631 move gerrit.wm.org SSH service to private/behind LVS like phab-vcs In-Scope Open None
Other Operations 161003 Cross-check disabled accounts from corp LDAP against data.yaml In-Scope Open None
Other Operations 163336 kube-proxy pulls in docker and starts service even when it isnt needed In-Scope Open None
Other Operations 191491 Adjust bandwidth/connection limits, memory settings on labstore1006,7 as appropriate In-Scope Open None
Other Operations 133656 Have a paging check for Nova API accessible In-Scope Open None
Other Operations 185215 Puppet compiler failure to lookup some keys In-Scope Open None
Other Operations 163362 audit all codfw pdu tower draws In-Scope Open None
Other Operations 161835 Convert labstore cluster configuration to hiera and profiles In-Scope Open None
Other Operations 149804 Review of ferm services without srange In-Scope Open None
Other Operations 192751 Please upload large file to Wikimedia Commons In-Scope Open None
Other Operations 122127 Translation of namespaces for Gilaki In-Scope Open None
Other Operations 154915 Get rid of "import realm.pp" in manifests/site.pp In-Scope Open None
Other Operations 187456 Decommission labstore100[12] and their disk shelves In-Scope Open None
Other Operations 104671 Rename 'restricted' group? In-Scope Open None
Other Operations 109090 Investigate the need for master only (non data nodes) in our ES cluster In-Scope Open None
Other Operations 98984 Check power supply balance settings on cp3030+ In-Scope Open None
Other Operations 205037 releases servers: set rsync direction based on active dc, add warning motd on inactive server Screep Open None
Other Operations 200690 Wrong umask when deploying from screen Screep Open None
Other Operations 164993 archiva artifact links point to 127.0.0.1 In-Scope Open None
Other Operations 186311 wikitech-l is mangling my PGP/MIME emails, causing signature validation to fail In-Scope Open None
Other Operations 200022 2018 data center switchover: Move all the things over to codfw Screep Open None
Other Operations 147872 Rename rhodium to puppetmaster1003 In-Scope Open None
Other Operations 186153 outdated DjVu file page thumbnail in cache In-Scope Open None
Other Operations 161145 Fix the general problem of randomly-bad puppet agent cron timings within redundant clusters In-Scope Open None
Other Operations 198939 Decommission servermon Screep Open None
Other Operations 190455 Logstash no longer captures DB queries in debug mode In-Scope Open None
Other Operations 190318 remove puppet_major_version and puppetdb_major_version variables. clean up puppet master/db hieradata In-Scope Open None
Other Operations 170817 Upgrade Thumbor servers to Stretch In-Scope Open None
Other Operations 201611 Deploy translation-server-v2 Elaborated Open None
Other Operations 108985 Monitor MediaWiki sessions In-Scope Open None
Other Operations 186918 prometheus: ganglia-gen and outdated Ganglia:cluster resource name In-Scope Open None
Other Operations 164819 reprepro: Support for buildinfo files / dbgsym packages In-Scope Open None
Other Operations 193766 Ship host syslogs to ELK In-Scope Open None
Other Operations 123237 Provide production jessie image with node 4.2; use this for service-runner build command In-Scope Open None
Other Operations 147923 Extract metrics from logs In-Scope Open None
Other Operations 154665 Look into behaviour of /etc/exim4/update-exim4.conf.conf related to updates In-Scope Open None
Other Operations 200023 2018 data center switchover: Move all the things back to eqiad Screep Open None
Other Operations 88730 Nutcracker needs to automatically recover from MC failure - rebalancing issues In-Scope Open None
Other Operations 151047 Integrate Yubikey into data.yaml In-Scope Open None
Other Operations 180330 Add CI to all operations/* repositories and archive obsolete ones In-Scope Open None
Other Operations 135991 Automated service restarts for common low-level system services In-Scope Open None
Other Operations 178877 operations/software repo: flake8 check In-Scope Open None
Other Operations 190693 Extend dpkg Icinga check to also check for inconsistent apt state In-Scope Open None
Other Operations 76306 Set warning thresholds for average cluster utilization In-Scope Open None
Other Operations 150396 Phabricator leaving old files in /tmp In-Scope Open None
Other Operations 148017 lvs2002 repeated usb connect/disconnect message In-Scope Open None
Other Operations 140813 Protect sensitive user-related information with a UserData / auth / session service In-Scope Open None
Other Operations 157038 Make it possible to run the mediawiki testsuite against a staging repo of apt.wikimedia.org In-Scope Open None
Other Operations 147040 Two recently uploaded files have disappeared (404) In-Scope Open None
Other Operations 159536 Puppet constantly trying to stop the already stopped puppetmaster process on Trusty In-Scope Open None
Other Operations 182759 Add Prometheus exporter to Jenkins instances In-Scope Open None
Other Operations 109606 Re-evaluate Limesurvey In-Scope Open None
Other Operations 153416 docker-engine pulled into our repositories only keeps the latest version In-Scope Open None
Other Operations 173721 Track down the source of periodic increases in requests to swift eqiad In-Scope Open None
Other Operations 116742 Track amount of package updates on systems In-Scope Open None
Other Operations 184461 Discourse migration from wmflabs to production In-Scope Open None
Other Operations 203434 Decom mw2213 Screep Open None
Other Operations 150872 Replace OCG in collection extension with Electron In-Scope Open None
Other Operations 113785 Make the Shinken IRC alert and icinga-wm bots use colors In-Scope Open None
Other Operations 129963 Update memcached package and configuration options In-Scope Open None
Other Operations 199427 Separate dev Change-Prop from production Kafka cluster Screep Open None
Other Operations 141038 implement icinga paging for non-ops teams In-Scope Open None
Other Operations 149885 Investigate Swift as a storage backend for maps tiles In-Scope Open None
Other Operations 204857 notebook1003 failed network mount on boot Screep Open None
Other Operations 204047 investigate tilerator crash on maps eqiad Screep Open None
Other Operations 175213 2017/18 Annual Plan Program 8: Multi-datacenter support, Q2 goals In-Scope Open None
Other Operations 163393 Determine appropriate proxy_read_timeout setting for Tools Proxy In-Scope Open None
Other Operations 103886 Translation cache exhaustion caused by changes to PHP code in file scope In-Scope Open None
Other Operations 176666 Qualtrics cannot send email to wikimedia.org addresses In-Scope Open None
Other Operations 41785 Create a Cloud VPS SMTP smarthost In-Scope Open None
Other Operations 153068 Consider mounting labs NFS labstore1003.eqiad.wmnet:/scratch for server-side uploads In-Scope Open None
Other Operations 178628 Improve puppet alerting In-Scope Open None
Other Operations 160060 Icinga check for sysctl settings In-Scope Open None
Other Operations 193655 rack/setup/install cloudstore1008 & cloudstore1009 In-Scope Open None
Other Operations 140270 Determine a core set or a checklist of permissions for deployment purpose In-Scope Open None
Other Operations 191648 uwsgi::app sorts config keys, but the .ini file behavior depends on order In-Scope Open None
Other Operations 175876 document all scs connections In-Scope Open None
Other Operations 197873 how to structure wiki pages for Icinga reaction play books In-Scope Open None
Other Operations 188985 https://meta.wikimedia.org/wiki/Special:Contact/Stewards is being abused by spammers In-Scope Open None
Other Operations 175206 2017/18 Annual Plan Program 8: Multi-datacenter support In-Scope Open None
Other Operations 197084 Report problems found in server's IPMI SEL In-Scope Open None
Other Operations 121610 system users with UIDs > 500 In-Scope Open None
Other Operations 193155 IPMI Audit 2018-04 In-Scope Open None
Other Operations 205618 rsync puppet module doesn't delete removed config Screep Open None
Other Operations 119718 Make it easier to ban misbehaving dashboards from graphite In-Scope Open None
Other Operations 203959 SRE quarterly goal: allow MediaWiki requests to be served by PHP7 alongside HHVM Screep Open None
Other Operations 119846 Redirect revisions from svn.wikimedia.org to https://phabricator.wikimedia.org/rSVN In-Scope Open None
Other Operations 120585 Make l10nupdate user a system user In-Scope Open None
Other Operations 168767 Monitor PostgreSQL connection slots In-Scope Open None
Other Operations 116580 monitor postgresql replication status In-Scope Open None
Other Operations 159750 E-mail for people in different OIT LDAP object unit In-Scope Open None
Other Operations 199125 rack/setup/install cloudvirt102[34] Screep Open None
Other Operations 148843 GPU upgrade for stats machine In-Scope Open None
Other Operations 184462 Serve one production service via Kubernetes In-Scope Open None
Other Operations 176153 Create affcom-staff email account In-Scope Open None
Other Operations 156955 Standardizing our partman recipes In-Scope Open None
Other Operations 95052 Make ircecho much better In-Scope Open None
Other Operations 197470 find a way to systematically update the deployment server name across all repos In-Scope Open None
Other Operations 131748 Refresh the appservers puppet code/configs In-Scope Open None
Other Operations 204845 logstash-beta.wmflab throws multiple "Error: Could not locate that visualization" Screep Open None
Other Operations 196698 rack/setup/install auth1002 In-Scope Open None
Other Operations 194855 Degraded RAID on cloudvirt1020 In-Scope Open None
Other Operations 163354 Find a way to verify mediawiki-config IPs ahead of datacenter switchovers In-Scope Open None
Other Operations 144933 Cleanup debconf handling in mailman puppet setup In-Scope Open None
Other Operations 196474 Externalize tile storage for maps In-Scope Open None
Other Operations 194558 Enable CAPTCHA on mailman instances In-Scope Open None
Other Operations 135128 Turn on etcd TLS for intra-cluster communications In-Scope Open None
Other Operations 95801 Allow customizing the alert message from graphite In-Scope Open None
Other Operations 97368 Investigate more efficient memcached solution for CacheAwarePropertyInfoStore Screep Open None
Other Operations 192547 Improve remote IPMI monitoring In-Scope Open None
Other Operations 185298 xfs_db blocked / timeout on ms-be2023 In-Scope Open None
Other Operations 193733 Move dispatching of wikidata to a dedicated node In-Scope Open None
Other Operations 163068 More missing 'original' files on Commons In-Scope Open None
Other Operations 184066 Procure and install new PDUs In-Scope Open None
Other Operations 104774 Publishing translations for central notice banners fails In-Scope Open 2.0
Other Operations 170740 PuppetDB misbehaving on 2017-07-15 In-Scope Open None
Other Operations 203625 mwdebug1001 and mwdebug1002 are reliably the last two hosts to finish scap-cdb-rebuild Screep Open None
Other Operations 188453 Google Search Console access for Search Platform team In-Scope Open None
Other Operations 129188 mw2212 unresponsive In-Scope Open None
Other Operations 141959 Moving network::external to hiera broke much of labs In-Scope Open None
Other Operations 198256 RFC: Modern Event Platform - Choose Schema Tech Screep Open None
Other Operations 194669 Provide a mean to mass discard/reject subscription requests on Wikimedia mailing lists In-Scope Open None
Other Operations 167035 stretch acct monthly cron will spam when /var/log/wtmp.1 doesn't exist In-Scope Open None
Other Operations 199228 Define an SLO for Wikidata Query Service public endpoint and communicate it Screep Open None
Other Operations 110169 Monitor redis memory/disk usage In-Scope Open None
Other Operations 199198 Some swift filesystems reporting negative disk usage Screep Open None
Other Operations 138685 notebook1001 shown as DOWN in icinga, due to firewall rules In-Scope Open None
Other Operations 150871 [EPIC] (Proposal) Replicate core OCG features and sunset OCG service In-Scope Open None
Other Operations 182832 Apache on phab1001 is gradually leaking worker processes which are stuck in "Gracefully finishing" state In-Scope Open None
Other Operations 176774 Reimage cobalt as stretch In-Scope Open None
Other Operations 193072 TTS server deployment strategy In-Scope Open None
Other Operations 132216 Setting up bulk proxies pointing to a multiwiki mediawiki-vagrant setup running on a labs vm In-Scope Open None
Other Operations 184061 SRE 2017-18 Q3 goal Cleanup esams and refresh servers and infrastructure (tracking) In-Scope Open None
Other Operations 113792 Change LDAP cn to something more useful (was Rename "Dzahn" to "Daniel Zahn" in Gerrit) In-Scope Open None
Other Operations 88997 Improve graphite failover In-Scope Open None
Other Operations 194966 disk usage increase on maps servers In-Scope Open None
Other Operations 181803 Stop storing Mailman passwords in plain text Screep Open None
Other Operations 140594 svn.wikimedia.org redirects to Diffusion main page, hence hard to find e.g. "flexbisonparse" In-Scope Open None
Other Operations 165618 Audit / document reasons for not enabling HT? In-Scope Open None
Other Operations 185644 Switch phabricator from using apache to nginx In-Scope Open None
Other Operations 200103 requesting additional production ssh key for jmorgan Screep Open None
Other Operations 202435 Create mailing list for Bureaucrat of zh.wikipedia Screep Open None
Other Operations 159524 backup space is used unwisely In-Scope Open None
Other Operations 123560 investigate rsync between dcs with encryption In-Scope Open None
Other Operations 142827 Enforce reference to Phabricator task for all commits to modules/admin/data/data.yaml In-Scope Open None
Other Operations 122825 Service Ownership and Maintenance In-Scope Open None
Other Operations 170628 Jessie rsvg/cairo can't render specific SVG file on Commons In-Scope Open None
Other Operations 142205 use granularity (g=) restrictions for wikimedia.org fundraising DKIM records In-Scope Open None
Other Operations 205526 Register and identify icinga-wm Screep Open None
Other Operations 179078 mpt raid controller not detected as fact on maps-test2* In-Scope Open None
Other Operations 143556 Setting up grafana should also setup Anonymous read-only access for the default org In-Scope Open None
Other Operations 120532 Use user-specific passwords for accessing EventLogging database In-Scope Open None
Other Operations 181546 Let the ORES application set log severity, not uWSGI In-Scope Open None
Other Operations 195050 Refactor pipeline build step to be more isolated/secure/scalable In-Scope Open None
Other Operations 186073 Rack/setup frmon1001 In-Scope Open None
Other Operations 124413 confctl should provide tags information after writing data In-Scope Open None
Other Operations 55457 setup a DB backed parser cache In-Scope Open None
Other Operations 194036 mw1230 sdb "Raw_Read_Error_Rate" SMART In-Scope Open None
Other Operations 205736 Requesting access to to stats, analytics-search-users, statistics-privatedata-users for Chelsy Xie Screep Open None
Other Operations 187257 puppetdb4: systemd config review In-Scope Open None
Other Operations 160071 Add slabinfo prometheus exporter In-Scope Open None
Other Operations 186069 Icinga: page in case all MediaWiki are throwing 5xx In-Scope Open None
Other Operations 204110 Add favicon to icinga and tendril Screep Open None
Other Operations 119660 Set up LVS for labs dns recursors In-Scope Open None
Other Operations 171191 Should puppet auto-restart slapd? In-Scope Open None
Other Operations 78135 Provide a pxe-bootable rescue image In-Scope Open None
Other Operations 193628 tungsten disk 1 and 8 SMART failure In-Scope Open None
Other Operations 84163 Fix CirrusSearch monitoring In-Scope Open None
Other Operations 152100 should we make privatewiki list available to puppet without maintaining two lists? In-Scope Open None
Other Operations 178575 Add require_package() variant with repository component to wmflib In-Scope Open None
Other Operations 161904 decommission backup4001 In-Scope Open None
Other Operations 175885 Toolforge's static webserver broken by Puppet changes and stale nginx packages In-Scope Open None
Other Operations 131832 Unable to restore file that has a very large file size In-Scope Open None
Other Operations 196507 Degraded RAID on cloudvirt1019 In-Scope Open None
Other Operations 116767 limit the impact of heavy/large graphite queries In-Scope Open None
Other Operations 151045 Extending Yubico 2FA for production use (meta bug) In-Scope Open None
Other Operations 130590 Have dedicated master nodes for elasticsearch In-Scope Open None
Other Operations 198622 migrate maps servers to stretch with the current style Screep Open None
Other Operations 110240 [Discussion] Consider validating JSON schemas when running x-ample tests? In-Scope Open None
Other Operations 142984 Review lists of config/sysctl recommendations by "kernel self-protection project" In-Scope Open None
Other Operations 163823 During labservices1001 failover fqdn changed from foo.project.eqiad.wmflabs to foo.eqiad.wmflabs In-Scope Open None
Other Operations 160229 Back up of Commons files In-Scope Open None
Other Operations 183454 Deprovision Diamond collectors no longer in use In-Scope Open None
Other Operations 123809 Module uwsgi doesn't allow passing multiple config params of same name In-Scope Open None
Other Operations 118677 Nastaleeq font for Western Punjabi In-Scope Open None
Other Operations 93531 secure.wikimedia.org entries still showing up in Google search results In-Scope Open None
Other Operations 91404 Setup backups of elasticsearch indices In-Scope Open None
Other Operations 203485 Revisit Grafana/Icinga notification strategy Screep Open None
Other Operations 204363 Modify elasticsearch_shard_size_check plugin to display only indices and shard size Screep Open None
Other Operations 179562 Create jenkins job for creating deployment artifacts for `docker-pkg-deploy` In-Scope Open None
Other Operations 189741 Build .deb package of python3-aiokafka In-Scope Open None
Other Operations 184634 Netbox: postgres cannot be restarted w/ current config In-Scope Open None
Other Operations 203244 analytics1068 doesn't boot Screep Open None
Other Operations 162123 Running swiftrepl is not puppetized In-Scope Open None
Other Operations 198787 Revisit default settings for c-foreach-restart Screep Open None
Other Operations 166291 Exim panics when spamd reaches maxchildren In-Scope Open None
Other Operations 191956 Document how to fix IPMI issues on Wikitech In-Scope Open None
Other Operations 140075 investigate swift used space spikes since June 2016 In-Scope Open None
Other Operations 181971 Disable hiera autolookups In-Scope Open None
Other Operations 160941 Improve SSH access information in onboarding documentation In-Scope Open None
Other Operations 202033 Feedback Appreciated: Use of HTTP Without TLS Screep Open None
Other Operations 204970 setup/install an-coord1001/wmf7621 Screep Open None
Other Operations 141756 audit / test / upgrade hp smartarray P840 firmware In-Scope Open None
Other Operations 150466 publish kartotherian / tilerator metrics by cluster In-Scope Open None
Other Operations 205507 Decommission analytics100[1,2] Screep Open None
Other Operations 136311 Monitor the BMC's event log for hardware errors In-Scope Open None
Other Operations 193408 SPF record for canonical domains In-Scope Open None
Other Operations 120377 labmon1001 graphite instance archiver keeps archiving the same instances In-Scope Open None
Other Operations 141783 Add monitoring for detecting when logstash services are down In-Scope Open None
Other Operations 202535 decom rigel.frack.codfw.wmnet Elaborated Open None
Other Operations 101141 udp rcvbuferrors and inerrors on graphite1001 In-Scope Open None
Other Operations 92471 enable authenticated access to Cassandra JMX In-Scope Open None
Other Operations 116951 Reprepro should bail if it can't read and sign using the root keys In-Scope Open None
Other Operations 45952 Incorrect "non-identical file already exists" error when undeleting file on Commons In-Scope Open None
Other Operations 205396 Evaluate/integrate rasdaemon as a replacement for mcelog Screep Open None
Other Operations 200209 Decom graphite2001 Screep Open None
Other Operations 196665 rack/setup/install bast2002.wikimedia.org In-Scope Open None
Other Operations 125411 Diamond load averages do not contain scaled versions In-Scope Open None
Other Operations 195421 update physical labels from naos.codfw.wmnet to deploy2001.codfw.wmnet In-Scope Open None
Other Operations 100777 expose hosts in maintenance state so we can prevent scap from running on them In-Scope Open None
Other Operations 199431 Consider the possibility of separating ChangeProp and JobQueue on Kafka level Screep Open None
Other Operations 187984 Update OTRS to the latest stable version (6.x.x) In-Scope Open None
Other Operations 205539 Puppet doesn't restart ircecho when the code changes Screep Open None
Other Operations 199321 Return graphite200[12] to spares pool Screep Open None
Other Operations 199876 Migrate pool counters to stretch Screep Open None
Other Operations 204383 Update Debian Package for Scap to 3.8.6-1 Elaborated Open None
Other Operations 155929 Create /community-beacon alternative entry point In-Scope Open None
Other Operations 197173 Ship MX logs to ELK In-Scope Open None
Other Operations 133164 Document eqiad/codfw transition plan for OCG In-Scope Open None
Other Operations 203777 Successfully switch backend traffic (MediaWiki, Swift, RESTBase, Parsoid and services) to be served from eqiad Screep Open None
Other Operations 179463 Create a single application to provision and manage developer (LDAP) accounts In-Scope Open None
Other Operations 132104 Consider moving policy.wikimedia.org away from WordPress.com In-Scope Open None
Other Operations 160101 Upgrade php5-json .deb to at least 1.3.8 In-Scope Open None
Other Operations 202255 Support for QLogic FastLinQ 41112 Dual Port 10Gb SFP+ Adapter Screep Open None
Other Operations 185024 Readd complete URL parsing fix from 3.18.7 release In-Scope Open None
Other Operations 160644 Eventstreams graphite disk usage In-Scope Open None
Other Operations 111934 Nutcracker stats monitoring should only listen on localhost In-Scope Open None
Other Operations 204830 Temporarily redirect sgs.wikipedia.org to bat-smg.wikipedia.org until bat-smg->sgs move can be done Screep Open None
Other Operations 156475 Investigate spike in 500s during asw-c2-eqiad replacement In-Scope Open None
Other Operations 197624 Improve visibility of incoming operations tasks In-Scope Open None
Other Operations 165511 Change automatic shortlink in blog theme In-Scope Open None
Other Operations 159661 Improve mwmaint servers (e.g. mwmain1001) userland to process server side uploads In-Scope Open None
Other Operations 158288 Unclean stop of jobrunner service via puppet In-Scope Open None
Other Operations 132324 Tracking and Reducing cron-spam to root@ In-Scope Open None
Other Operations 187078 Re-consider ` >/dev/null 2>&1` as output of many cron'd MW maintenance scripts In-Scope Open None
Other Operations 95742 Decomission amssq31-62 (32 hosts) In-Scope Open None
Other Operations 197003 Dismantle most of the old jobqueue infrastructure Screep Open None
Other Operations 150822 Internal PKI for secure communication - Barcelona Ops offsite 2016 In-Scope Open None
Other Operations 174172 unused grafana-dashboard indices on elasticsearch / logstash In-Scope Open None
Other Operations 198756 Audit log producers across the infrastructure and plan their transition to centralized logging. Screep Open None
Other Operations 167245 prometheus-node-exporter - invalid group: ‘prometheus:prometheus' In-Scope Open None
Other Operations 128716 Make icinga-wm report Tools homepage check at #wikimedia-labs, too In-Scope Open None
Other Operations 176370 Migrate to PHP 7 in WMF production In-Scope Open None
Other Operations 180853 Bring discourse.mediawiki.org to production In-Scope Open None
Other Operations 201364 rack/setup/install sulfur.wikimedia.org Screep Open None
Other Operations 170108 Operations Q1 goal: Streamlined Service Delivery In-Scope Open None
Other Operations 163673 Some swift disks wrongly mounted on 5 ms-be hosts In-Scope Open None
Other Operations 126574 puppet should try to mount all mountable swift filesystems In-Scope Open None
Other Operations 144539 Remove /srv/deployment/wdqs/wdqs/rules.log symlink In-Scope Open None
Other Operations 183146 Monitor resource usage on a per-cgroup basis In-Scope Open None
Other Operations 148986 Firewall sets not being loaded post-reboot due to a @resolve race on jessie In-Scope Open None
Other Operations 184714 Puppet fail to properly refresh Icinga In-Scope Open None
Other Operations 86546 graphite-web logs are not rotated In-Scope Open None
Other Operations 119679 Rewrite http://download.wikimedia.org/mediawiki/ -> https://releases.wikimedia.org/mediawiki in less than 3 redirects In-Scope Open None
Other Operations 110171 Alert when ES indexes are freezed for more than 30 minutes In-Scope Open None
Other Operations 141897 Review new service 'pre-deployment to production' checklist In-Scope Open None
Other Operations 204567 ms-be2030 spontaneous reboot Screep Open None
Other Operations 152632 Explore hosting the multimedia commons use case In-Scope Open None
Other Operations 141524 eventbus should send statsd in batches In-Scope Open None
Other Operations 86552 Monitor and alarm on SMART attributes In-Scope Open None
Other Operations 184086 Add prometheus exporter to Gerrit In-Scope Open None
Other Operations 191199 Page allocation stalls on scb1001, scb1002 In-Scope Open None
Other Operations 197086 Report problems found by mcelog In-Scope Open None
Other Operations 177195 Reduce technical debt in metrics monitoring In-Scope Open None
Other Operations 129847 conftool-merge should report which node is setting attributes for In-Scope Open None
Other Operations 163339 codfw pdu phase inbalances: audit and correct In-Scope Open None
Other Operations 196886 Replace wtp1043's sda In-Scope Open None
Other Operations 203003 Keyholder phab repo duplicate work Screep Open None
Other Operations 167422 Monitoring: add link to graph for Icinga timeseries alarms In-Scope Open None
Other Operations 46791 [[wikitech:Server_admin_log]] should not rely on freenode irc for logmsgbot entries In-Scope Open None
Other Operations 191315 Cassandra Graphite metrics space usage audit and cleanup In-Scope Open None
Other Operations 193664 Knock down puppet 4 deprecation warnings In-Scope Open None
Other Operations 84845 improve cron spam visibility In-Scope Open None
Other Operations 40860 security@mediawiki.org : Create a public key and publish it on the public key servers In-Scope Open None
Other Operations 156570 Investigate issues with wikitech-static.wikimedia.org In-Scope Open None
Other Operations 144431 RESTBase k-r-v as Cassandra anti-pattern In-Scope Open None
Other Operations 17000 Special:Import error: "Import failed: Could not open import file" In-Scope Open None
Other Operations 156140 Lots of hosts with hyperthreading disabled In-Scope Open None
Other Operations 135385 investigate carbon-c-relay stalls/drops towards graphite2002 In-Scope Open None
Other Operations 196994 Open Phab tasks on SMART failure In-Scope Open None
Other Operations 205543 Give Mathew full root on wdqs servers Screep Open None
Other Operations 194171 rdb2002 correctable memory errors In-Scope Open None
Other Operations 205452 Setup access from service to mysql Screep Open None
Other Operations 197242 Transition citoid to use Zotero's translation-server-v2 In-Scope Open None
Other Operations 189065 Outbound mail from Greenhouse is broken In-Scope Open None
Other Operations 200338 Address mass overload errors in ORES (July 2018, UW origin) Screep Open None
Other Operations 136312 Encrypt syslog traffic In-Scope Open None
Other Operations 187987 Upgrade to Prometheus 2.x In-Scope Open None
Other Operations 183565 Fix regex.yaml single-regex issue In-Scope Open None
Other Operations 190568 Reimage both phab1001 and phab2001 to stretch In-Scope Open None
Other Operations 189629 rename role::xenon In-Scope Open None
Other Operations 146355 Replace etcd internal auth mechanism with a frontend proxy In-Scope Open None
Other Operations 151009 Provide authenticated access to Prometheus native web interface In-Scope Open None
Other Operations 130617 Collect metrics on CirrusSearch usage of PoolCounter In-Scope Open None
Other Operations 129180 Preserve SSH host key when re-imaging hosts In-Scope Open None
Other Operations 94951 Enable the usage of `hhvm -m debug --debug-host ::1` from mw1017 so developers can step through code (think gdb) in production to see what is going wrong. In-Scope Open None
Other Operations 180641 reinstall RT server with private IP and stretch In-Scope Open None
Other Operations 186625 apply hostname labels to bast1002/WMF4749 In-Scope Open None
Other Operations 187754 Figure out why HHVM isn't using error_document404 setting In-Scope Open None
Other Operations 116063 Hardware Automation Workflow - Overall Tracking In-Scope Open None
Other Operations 134237 Graphoid returns a 400 on MW API time-out In-Scope Open None
Other Operations 204362 Resolve elasticsearch shard size alert by doing an in place reindex Screep Open None
Other Operations 116627 Include 5xx numbers in fluorine fatalmonitor In-Scope Open None
Other Operations 201939 rack/setup/install an-master100[12].eqiad.wmnet Screep Open None
Other Operations 155761 DNS repo: add Jenkins job to ensure there are no duplicates In-Scope Open None
Other Operations 134811 Consider REST with SSL (HyperSwitch/Cassandra) for session storage In-Scope Open None
Other Operations 97909 Upgrade jobrunners to redis 2.8 In-Scope Open None
Other Operations 186734 Clean up redundant ORES celery_workers defaults In-Scope Open None
Other Operations 154619 Export ipsec counters as Prometheus metrics In-Scope Open None
Other Operations 187658 Setup cron for foreachwikiindblist all-labs.dblist extensions/AbuseFilter/maintenance/purgeOldLogIPData.php on Beta In-Scope Open None
Other Operations 97524 ocg alarm ocg_job_status_queue 'flapping' In-Scope Open None
Other Operations 165136 Ferm rules for labstore NFS hosts In-Scope Open None
Other Operations 164123 tools-k8s-master-01 has two floating IPs In-Scope Open None
Other Operations 204106 Log slow queries on postgresql / maps Screep Open None
Other Operations 199083 Migrate the hardware inventory from Racktables to Netbox Screep Open None
Other Operations 197819 investigate caching of mailman listinfo pages In-Scope Open None
Other Operations 195252 Update prometheus-varnish-exporter on debian to 1.4 In-Scope Open None
Other Operations 111653 Encrypt all the things In-Scope Open None
Other Operations 200733 Close performance@lists.wikimedia.org in favour of wikitech-l Screep Open None
Other Operations 170298 sshd stretch puppet support In-Scope Open None
Other Operations 176335 logs sent to logstash are lost when the elasticsearch cirrus cluster is unavailable In-Scope Open None
Other Operations 169564 MD RAID: remove mdadm daily check In-Scope Open None
Other Operations 199890 missed pages from kafka outage on July 11 2018 Screep Open None
Other Operations 123276 URL parameters do not work with pages that have "?" in their names In-Scope Open None
Other Operations 142991 Enable "upload by url" feature at zhwiki In-Scope Open None
Other Operations 194186 rack/setup/install cloudelastic100[1-4].eqiad.wmnet systems In-Scope Open None
Other Operations 122917 Provide a good download service of dumps from Wikimedia In-Scope Open None
Other Operations 119274 Check incoming requests to secure.wm.o In-Scope Open None
Other Operations 196477 rack/setup/install backup2001 In-Scope Open None
Other Operations 169322 Research whether it makes sense to have OTRS installation in an HA setup In-Scope Open None
Other Operations 172584 Securing external binaries run by MediaWiki In-Scope Open None
Other Operations 199670 Integrate Stretch 9.5 point release Screep Open None
Other Operations 204974 Request new mail list for Vietnam Wikimedians User Group Screep Open None
Other Operations 137939 Increase frequency of OSM replication In-Scope Open None
Other Operations 94457 Install nodejs, nginx and other dependencies on francium In-Scope Open None
Other Operations 119401 Untangle labs/production roles from labs/instance roles In-Scope Open None
Other Operations 185055 Stack overflow when Redis is down In-Scope Open None
Other Operations 183381 Deploy JADE extension to production In-Scope Open None
Other Operations 198757 Investigate log shipping methods and standardize on them (logstash) Screep Open None
Other Operations 205240 MCE errors on mw2181 / temperature warnings Screep Open None
Other Operations 179022 Backport firejail 0.9.52 for use on Wikimedia appservers In-Scope Open None
Other Operations 158429 Switch to predictable network interface names? In-Scope Open None
Other Operations 200706 rack/setup/install centrallog1001.eqiad.wmnet Screep Open None
Other Operations 178690 Better organization for SRE grafana dashboards In-Scope Open None
Other Operations 190086 Decommission old server wmf4077 In-Scope Open None
Other Operations 197862 Increase the CPU count for proton[12]00[12] In-Scope Open None
Other Operations 162037 Use SSL certificates with discovery entry for elasticsearch In-Scope Open None
Other Operations 141255 Separate host lookup from the sql shell script In-Scope Open None
Other Operations 158434 Phabricator: Make sure phabricator works properly including our puppet roles on jessie In-Scope Open None
Other Operations 156544 Create backups of Wikimedia content in diverse geographic places In-Scope Open None
Other Operations 199780 labstore1003: more SMART failures Screep Open None
Other Operations 202782 upgrade icinga server to stretch and replace einsteinium Screep Open None
Other Operations 171619 ORES should use a git large file plugin for storing serialized binaries In-Scope Open None
Other Operations 202061 Implement an accurate and easy to understand status page for all wikis Screep Open None
Other Operations 179181 Puppet4: hiera() can only be called using the 4.x function API. In-Scope Open None
Other Operations 149421 Long running mediawiki web requests impacts service availability, specially databases In-Scope Open None
Other Operations 167091 Elasticsearch errors about BulkShardRequest In-Scope Open None
Other Operations 177197 Export Prometheus-compatible JVM metrics from JVMs in production In-Scope Open None
Other Operations 150917 Remove deprecated features from book creator UI In-Scope Open None
Other Operations 102575 document graphite failover/backfill procedures In-Scope Open None
Other Operations 148048 Store Wikimedia unified account name (SUL) in LDAP directory In-Scope Open None
Other Operations 191659 Configure a threshold for earlier notification of /srv/cassandra/instance-data In-Scope Open None
Other Operations 93138 Procure hardware for Sentry In-Scope Open None
Other Operations 167412 host-vmem.erb is doing operations that make no sense In-Scope Open None
Other Operations 140942 Tracking: Monitoring and alerts for "business" metrics In-Scope Open None
Other Operations 170995 Setup a mirror for R language dependencies (CRAN) In-Scope Open 10.0
Other Operations 104352 Make scap able to depool/repool servers via the conftool API In-Scope Open None
Other Operations 36947 Incorrect text positioning in SVG rasterization (scale/transform; font-size; kerning) In-Scope Open 0.0
Other Operations 163667 Fix UIDs for deployment server users In-Scope Open None
Other Operations 201366 rack/setup/install scandium.eqiad.wmnet (parsoid test box) Screep Open None
Other Operations 125085 Split the API MediaWiki appserver pool into two external/internal pools In-Scope Open None
Other Operations 150300 icinga notification if elevated writing to badpass.log In-Scope Open None
Other Operations 205416 Fix aggregation of "MediaWiki.RevisionSlider.event.load.sum" from average to sum Screep Open None
Other Operations 205364 helium (bacula) - Device not healthy -SMART- Screep Open None
Other Operations 200678 wtp2011 memory correctable errors Screep Open None
Other Operations 64987 librsvg misinterpret quoted font family names that contain whitespaces In-Scope Open None
Other Operations 133179 Redis monitoring needs to be improved In-Scope Open None
Other Operations 187434 Include apache_exporter in puppet module apache In-Scope Open None
Other Operations 98831 Honor DNT header for access logs & varnish logs In-Scope Open None
Other Operations 150356 Wikidata Query Service is overly verbose toward logstash In-Scope Open None
Other Operations 114446 move human users out of UID range for system accounts In-Scope Open None
Other Operations 171048 Eventbus does not handle gracefully changes in DNS recursors In-Scope Open None
Other Operations 152782 Kibana functionality missing after upgrade: histograms In-Scope Open None
Other Operations 128715 Add other Tools administrators to the Icinga notification group In-Scope Open None
Other Operations 192610 prometheus on bast3002 misbehaving In-Scope Open None
Other Operations 198220 Stop and remove old job runners In-Scope Open None
Other Operations 201342 rack/setup/install puppetmaster1003.eqiad.wmnet Screep Open None
Other Operations 111595 Do not apply spam headers on email assessed NOT to be spam In-Scope Open None
Other Operations 162122 Swiftrepl was stuck in an infinite loop since days In-Scope Open None
Other Operations 178839 New upstream jvm-tools In-Scope Open None
Other Operations 189921 decom californium In-Scope Open None
Other Operations 148061 Feasibility of hosting podcast setup on Wikimedia servers In-Scope Open None
Other Operations 134326 udpmxircecho should write stats of messages processed and we should alert when that drops to zero In-Scope Open None
Other Operations 161864 404 error while accessing some images files e.g. djvu and jpg In-Scope Open None
Other Operations 205158 Mail relays needed for VMs in eqiad1 Screep Open None
Other Operations 191018 Provide an option menu when booting via PXE In-Scope Open None
Other Operations 205380 Non-working archive for wikimediacz-l list Screep Open None
Other Operations 198209 Graphite returning 500 @ nagf and graphite url In-Scope Open None
Other Operations 185236 Password Vault for Security Team In-Scope Open None
Other Operations 196476 rack/setup/install Prometeuse/Grafana host frmon2001 for fr-tech In-Scope Open None
Other Operations 184230 Disavow emails from wikipedia.com In-Scope Open None
Other Operations 196484 rack/setup/install graphite1004 In-Scope Open None
Other Operations 112774 solve mtp panel issue for row uplinks In-Scope Open None
Other Operations 146090 High failure rate of account creation should trigger an alarm / page people In-Scope Open None
Other Operations 118641 Implement proper AAA for lists.wikimedia.org (mailman) Screep Open None
Other Operations 203169 Logstash hardware expansion Screep Open None
Other Operations 204079 move/setup/install frauth2001.frack.codfw.wmnet Screep Open None
Other Operations 137229 Tune thread for osm2pgsql / postgres max connections for Maps In-Scope Open None
Other Operations 139971 access_new_install role vs. Labs vs. the future In-Scope Open None
Other Operations 182274 Create custom per-job metric reporters capability In-Scope Open None
Other Operations 202574 convert cloud VPS projects from apache to httpd module Screep Open None
Other Operations 166937 Broken /a/refinery-source/guard/run_all_guards.sh script on stat1002 In-Scope Open None
Other Operations 191842 Deployment git server can't supply ORES hosts in parallel In-Scope Open None
Other Operations 151050 Proper documentation for Yubico 2FA for production use In-Scope Open None
Other Operations 179354 wikimedia-jessie & wikimedia-stretch docker images don't have deb-src set for apt.wikimedia.org In-Scope Open None
Other Operations 166066 Integrate the puppet compiler in the puppet CI pipeline In-Scope Open None
Other Operations 185815 The Rack Puppet master server is deprecated and will be removed in a future release. Please use Puppet Server instead. In-Scope Open None
Other Operations 165323 Add Prometheus machine metric to track core dumps In-Scope Open None
Other Operations 133093 Investigate idle appservers in codfw In-Scope Open None
Other Operations 170456 FY2017/18 Program 6 - Outcome 2 - Objective 3: Integrated, container-based development environment In-Scope Open None
Other Operations 153940 Logrotate fails for: "$FILE No such file or directory" In-Scope Open None
Other Operations 188602 Decrease the amount of IRC spam in case of widespread puppet failures In-Scope Open None
Other Operations 162612 codfw/eqiad hosts occasionally spend > 3 minutes starting networking.service with linux 4.9 In-Scope Open None
Other Operations 163698 Add flood protection to the ircecho bot (icinga-wm) In-Scope Open None
Other Operations 200362 Send logstash service metrics to prometheus Screep Open None
Other Operations 203402 in Commons, some PDFs are failing to render thumbnails. Screep Open None
Other Operations 156232 confctl SubjectAltNameWarning after python-urllib3 upgrade In-Scope Open None
Other Operations 147366 Setup automated topk wide row reporting In-Scope Open None
Other Operations 95054 Move ircecho config file to be YAML In-Scope Open None
Other Operations 141128 determine/process/document bios firmware tracking/updating policies In-Scope Open None
Other Operations 199251 furud: disconnect and power down all disk shelves Screep Open None
Other Operations 177826 Upgrade ci ssh key to ecdsa In-Scope Open None
Other Operations 182822 Generate a list of files that are supposed to exist but 404s In-Scope Open None
Other Operations 193573 Consider allowing mailing lists to be indexed by archive.org Screep Open None
Other Operations 179353 Scap: Standardize git version In-Scope Open None
Other Operations 153246 Puppet failures with "Attempt to assign to a reserved variable name: 'trusted'" In-Scope Open None
Other Operations 159830 Sanity check global-multiwrite logs for ConfirmEdit usage In-Scope Open None
Other Operations 184063 Remove all decommissioned hardware In-Scope Open None
Other Operations 169518 Decommission esams ms-fe / ms-be In-Scope Open None
Other Operations 140316 Add granularity limiter (g=) to wikimedia.org DKIM record(s) In-Scope Open None
Other Operations 170640 reports.frdev.wm.o -- still in use? In-Scope Open None
Other Operations 164042 Racktables: clearly show when hosts are decommissioned In-Scope Open None
Other Operations 204267 Flood of WDQS requests from wbqc Screep Open None
Other Operations 174916 electron/pdfrender hangs In-Scope Open None
Other Operations 142002 Clean up puppet & configs for ORES In-Scope Open None
Other Operations 124101 Specific revisions of multiple files missing from Swift - 404 Not Found returned In-Scope Open None
Other Operations 193272 Prometheus vs. CPU usage vs. hyperthreading In-Scope Open None
Other Operations 147204 Update confd package In-Scope Open None
Other Operations 117673 labs precise and jessie instance not accessible after provisioning In-Scope Open None
Other Operations 200312 Remove data from Hadoop's HDFS as part of the user offboard workflow Screep Open None
Other Operations 203674 Debian package or files managed my puppet for pt-kill-wmf Screep Open None
Other Operations 203852 rack/setup/install stat1007.eqiad.wmnet (stat1005 user replacement) Screep Open None
Other Operations 124991 evaluate possibility for nscd use with useldap In-Scope Open None
Other Operations 182819 custom fact interface_primary breaks under newer versions of facter In-Scope Open None
Other Operations 67394 [EPIC] Performance testing environment In-Scope Open None
Other Operations 130554 Official support for upgrade from existing Mailman 2.1 lists to Mailman 3 In-Scope Open None
Other Operations 203091 Move Graphoid to Kubernetes via the deployment pipeline Screep Open None
Other Operations 192370 Deploy mcrouter to production as a wancache backend In-Scope Open None
Other Operations 119719 Enforce a minimum refresh period for grafana dashboards hitting graphite In-Scope Open None
Other Operations 201140 Puppetize the installation of PHP-FPM on the MediaWiki hosts Elaborated Open None
Other Operations 150486 Deploy federation for Prometheus In-Scope Open None
Other Operations 199479 Add alerts for Logstash rates in production Screep Open None
Other Operations 175738 Long term storage for frack prometheus data In-Scope Open None
Other Operations 203239 Create Debian packages for Node.js 10 upgrade Screep Open None
Other Operations 180944 Passenger spews Exception NoMethodError in Rack application object In-Scope Open None
Other Operations 126158 [RFC] Alert about *when* partitions will run out of space, not a percentage/absolute number In-Scope Open None
Other Operations 106346 setup an alertable threshold for Cassandra heap dumps In-Scope Open None
Other Operations 170480 FY2017/18 Program 6 - Outcome 2: Developers are able to develop and test their applications through a unified pipeline towards production deployment. In-Scope Open None
Other Operations 114337 Assign 3 more servers to video scaler duty In-Scope Open None
Other Operations 192102 deprecate and remove --autoload in uwsgi puppet class In-Scope Open None
Other Operations 109089 EPIC: Cultivating the Elasticsearch garden (operational lessons from 1.7.1 upgrade) In-Scope Open None
Other Operations 184064 Prepare racks OE14, OE15 and OE16 with new infrastructure In-Scope Open None
Other Operations 199356 Contrabass MIDI instrument is unusable Screep Open None
Other Operations 160158 Make disabled accounts visible in the corp mirror LDAP replica In-Scope Open None
Other Operations 145065 Decrease time required to fully restart the Cirrus elasticsearch clusters In-Scope Open None
Other Operations 191625 Create RC feed for login.wikimedia In-Scope Open None
Other Operations 200984 Stop introducing new code expanded from erb templates Screep Open None
Other Operations 135113 Rationalize our jobqueues redis topology In-Scope Open None
Other Operations 134551 Create functional cluster checks for all services (and have them page!) In-Scope Open None
Other Operations 184065 Setup new access switches In-Scope Open None
Other Operations 32716 Run our own Tor client for Tor block In-Scope Open None
Other Operations 143552 Make elasticsearch configuration more robust to loss of network connectivity In-Scope Open None
Other Operations 187474 Decommission old and unused/spare servers in codfw In-Scope Open None
Other Operations 151048 Icinga monitoring for Yubikey components In-Scope Open None
Other Operations 135124 Deploy etcddump (or another etcd dump & load tool) to production In-Scope Open None
Other Operations 184564 Plan Puppet 5 upgrade In-Scope Open None
Other Operations 199676 Community Relations support for the 2018 data center switchover Screep Open None
Other Operations 127054 pinentry-gtk2 pulls in a lot of unneeded Gnome/GTK libs In-Scope Open None
Other Operations 187445 Decommission osm-db200[12] and osm-web200[1234] In-Scope Open None
Other Operations 181630 Send celery and wsgi service logs to logstash In-Scope Open None
Other Operations 178663 Switch CI Docker Storage Driver to its own partition and to use devicemapper In-Scope Open None
Other Operations 135595 mod_deflate + mod_uwsgi causing mangled apache responses In-Scope Open None
Other Operations 156937 Provide cross-dc redundancy (active-active or active-passive) to all important misc services In-Scope Open None
Other Operations 195392 Use PHP7 for web requests on jobrunner servers In-Scope Open None
Other Operations 187994 netfilter software at WMF: iptables vs nftables In-Scope Open None
Other Operations 184186 Fix unknown variables warning that occur with puppet 4.x In-Scope Open None
Other Operations 200838 v6 ND failure on puppetmaster1001/asw2-b-eqiad Elaborated Open None
Other Operations 148693 Deploy IDS rendering engine to production In-Scope Open None
Other Operations 197564 cronspam for slow queries in PageAssessments In-Scope Open None
Other Operations 182812 Forward security@tools.wmflabs.org to security@wikimedia.org In-Scope Open None
Other Operations 116750 2FA for SSH access to the production cluster In-Scope Open None
Other Operations 205567 PHP Warning "Unable to delete stat cache" from file uploads Screep Open None
Other Operations 125976 Run mediawiki::maintenance scripts in Beta Cluster In-Scope Open None
Other Operations 181559 Investigate redis-cluster or other techniques for making Redis not a single point of failure. In-Scope Open None
Other Operations 201016 Include ADD operation in memcached stats and grafana dashboard Screep Open None
Other Operations 204450 Why doesn't profile::mediawiki::nutcracker create /var/run/nutcracker/ ? Screep Open None
Other Operations 202885 Migrate elasticsearch scripts to spicerack cookbooks Screep Open None
Other Operations 198754 Logstash/Kibana architecture review Screep Open None
Other Operations 194012 labsdb1004 and labsdb1005 some hard disks not healthy Screep Open None
Other Operations 199073 Perform a datacenter switchover (2018-19 Q1) Screep Open None
Other Operations 198479 labvirt1009 HP Raid alert In-Scope Open None
Other Operations 170453 FY2017/18 Program 6: Streamlined Service delivery In-Scope Open None
Other Operations 105780 Create a doc explaining the SLA between services and the monitoring tool In-Scope Open None
Other Operations 159687 etcd switchover/enhancements In-Scope Open None
Other Operations 190766 add ci test for admin module indentation In-Scope Open None
Other Operations 101585 document redis upgrade/restart procedures In-Scope Open None
Other Operations 204135 Warn when CirrusSearch is not configured to use local DC for an extended time Screep Open None
Other Operations 202982 Requests to MW 404 when on HTTPS Screep Open None
Other Operations 146657 create notifications about user accounts that have not been used for a long time In-Scope Open None
Other Operations 202764 Wikidata produces a lot of failed requests for recentchanges API Screep Open None
Other Operations 151049 Run systematic availability tests In-Scope Open None
Other Operations 148647 refresh swift hardware in codfw/eqiad In-Scope Open None
Other Operations 188913 "Obama" page on Beta Cluster often responds with 503 In-Scope Open None
Other Operations 189729 Build .deb package of python3-typing for jessie In-Scope Open None
Other Operations 146968 OTRS spam classification methods and systems In-Scope Open None
Other Operations 137176 catch-all apache vhost on the cluster should return 404 for non-existing sites In-Scope Open None
Other Operations 141566 Change digest function of wikimedia-l@ so it send emails only once a day Screep Open None
Other Operations 201343 rack/setup/install mwmaint1002.eqiad.wmnet Screep Open None
Other Operations 132325 Weak digest algorithm (SHA1) used to sign InRelease on apt.wikimedia.org In-Scope Open None
Other Operations 199236 Handle SMART for multiple shelves and controllers Screep Open None
Other Operations 160529 Sender email spoofing In-Scope Open None
Other Operations 160412 Add lock_wait_timeout to maintain_views and maintain-meta_p In-Scope Open None
Other Operations 194174 wtp2013 memory correctable errors In-Scope Open None
Other Operations 180051 Reduce the number of fields declared in elasticsearch by logstash In-Scope Open None
Other Operations 106937 Monitor [[Special:ListFiles]] for non 200 HTTP statuses in thumbnails In-Scope Open None
Other Operations 198753 Modernize logging, alerting and metrics monitoring infrastructure - Adopt Logstash (2018-19 Q1 Goal) Screep Open None
Other Operations 150375 cronspam cleanup: Cron <www-data@terbium> /usr/local/bin/foreachwiki maintenance/cleanupUploadStash.php > /dev/null In-Scope Open None
Other Operations 201358 Increase job runners on video scalers to maximize load efficiency Screep Open None
Other Operations 185306 ms-be2023 unresponsive while rebuilding one disk In-Scope Open None
Other Operations 181967 Update puppet code to conform to puppet 4.x and later standards In-Scope Open None
Other Operations 169318 Use multiple puppetdbs on puppet masters In-Scope Open None
Other Operations 171188 Move the main WMCS puppetmaster into the Labs realm In-Scope Open None
Other Operations 133674 HHVM is leaking memory on the API appservers In-Scope Open None
Other Operations 133476 Proposal: Centralize OTRS login methodology In-Scope Open None
Other Operations 156136 Increase swift replication factor for accounts In-Scope Open None
Other Operations 167292 Collate jessie-wikimedia/backports into jessie-wikimedia/main In-Scope Open None
Other Operations 130593 investigate slapd memory leak In-Scope Open None
Other Operations 166038 Sync internal nutcracker package with Debian package In-Scope Open None
Other Operations 187473 Decommission old and unused/spare servers in eqiad In-Scope Open None
Other Operations 174269 Two cases of local-multiwrite storage backend failure In-Scope Open None
Other Operations 118812 Investigate mysterious_sysctl settings and figure out what to do with them In-Scope Open None
Other Operations 172815 Improve stability and maintainability of our browser-based PDF render service In-Scope Open None
Other Operations 198784 Degraded RAID on cp3048 Screep Open None
Other Operations 149589 Puppet tab in Horizon unusably slow In-Scope Open None
Other Operations 169287 etcd config depends on puppet certs, but puppet doesn't know In-Scope Open None
Other Operations 192457 Reallocate former image scalers In-Scope Open None
Other Operations 202898 Decommission maps-test cluster Screep Open None
Other Operations 149543 Setup PAWS internal experimentally on notebook* nodes In-Scope Open None
Other Operations 203645 rspec-puppet fails with Could not find the daemon directory (tested [/etc/sv,/var/lib/service]) Screep Open None
Other Operations 176437 puppet ca_server confusion In-Scope Open None
Other Operations 184236 Puppet broken on deployment-ms-be0[34] with evaluation error in swift module In-Scope Open None
Other Operations 135125 Install a second etcd cluster in codfw In-Scope Open None
Other Operations 198215 systemd-logind fails with result 'timeout' in db2093 and dns4001 In-Scope Open None
Other Operations 167689 Add RIPE atlas data to Prometheus In-Scope Open None
Other Operations 163288 Decide on /var/lib vs /home as locations of homedir for l10nupdate In-Scope Open None
Other Operations 151046 Fully puppetise yubikey-val In-Scope Open None
Other Operations 205522 ircecho / icinga-wm crashlooping Screep Open None
Other Operations 205712 wtp2020: correctable memory errors Screep Open None
Other Operations 132632 puppetize turning off reserved space for cassandra /srv In-Scope Open None
Other Operations 164490 maintain-meta_p hangs on connecting to wikimedia.org.uk In-Scope Open None
Other Operations 204801 Exec error "Possibly missing executable file: svn diff" from Special:Code Screep Open None
Other Operations 169286 labstore1005 A PCIe link training failure error on boot In-Scope Open None
Other Operations 135122 Reduce etcd technical debt In-Scope Open None
Other Operations 198041 graphite2001 crashed In-Scope Open None
Other Operations 131326 smokeping config puppetization issue? In-Scope Open None
Other Operations 177371 Phase out DSA keys for SSH access (ssh-dss) In-Scope Open None
Other Operations 168403 Aggregate prometheus functions yielding different results in grafana vs. prometheus console In-Scope Open None
Other Operations 87220 Minimize differences between beta and production (Tracking) In-Scope Open None
Other Operations 111540 Clean up labs graphite datapoints In-Scope Open None
Other Operations 187673 Build and deploy hhvm-luasandbox 3.0.1 to Wikimedia wikis In-Scope Open None
Other Operations 158757 Puppet certificate missing subjectAltName In-Scope Open None
Other Operations 158562 Manage apt sources via puppet? In-Scope Open None
Other Operations 175362 Split MXes into inbound and outbound In-Scope Open None
Other Operations 136562 Audit/fix hosts with no RAID configured In-Scope Open None
Other Operations 124185 Evaluate alternative web interfaces to icinga 1 core In-Scope Open None
Other Operations 192532 Figure out a way to enable volunteers to use the puppet compiler In-Scope Open None
Other Operations 186748 New service request: chromium-render/deploy In-Scope Open 5.0
Other Operations 166368 Wipe of spare/replacement disks In-Scope Open None
Other Operations 82937 re-create script for manual paging In-Scope Open None
Other Operations 195847 Clean up artifacts from LaTeX based math rendering In-Scope Open None
Other Operations 118829 Automate the provisioning and management of MediaWiki clusters In-Scope Open None
Other Operations 188317 Detect high server load earlier – prometheus alert? In-Scope Open None
Other Operations 138496 bring swift eqiad to one zone per row In-Scope Open None
Other Operations 199220 Cleanup cirrus keys in $wmfSwiftEqiadConfig Screep Open None
Other Operations 204364 Rate limit wdqs logs Screep Open None
Other Operations 142815 Enhance account handling (meta bug) In-Scope Open None
Other Operations 198901 Migrate production services to kubernetes using the pipeline Screep Open None
Other Operations 179230 Puppet wmf-style-guide: array of classes not detected properly In-Scope Open None
Other Operations 125015 Requests to (hard) redirect pages return their target's contents but are counted as pageviews to the redirect page In-Scope Open None
Other Operations 161296 Upgrade mysqld_exporter to 0.10.0 In-Scope Open None
Other Operations 180105 Set up a statsv-like endpoint for Prometheus In-Scope Open None
Other Operations 196697 rack/setup/add to spares tracking 2 single cpu misc class systems In-Scope Open None
Other Operations 174475 update firmware on scs consoles In-Scope Open None
Other Operations 120856 Remove all out of warranty unused cp10xx's from A2 In-Scope Open None
Other Operations 133844 Improve Elasticsearch icinga alerting In-Scope Open None
Other Operations 204032 Support meta tag refresh redirects in citoid to support elsevier's linking hub Elaborated Open None
Other Operations 179099 puppetmaster hostcert and hostprivkey point to nonexistent files In-Scope Open None
Other Operations 151314 logrotate failing with $FILE.1.gz: File exists In-Scope Open None
Other Operations 197554 Update wikitech-static mediawiki version In-Scope Open None
Other Operations 158022 make apt.wikimedia.org HA In-Scope Open None
Other Operations 107108 Flow notification links on mobile point to desktop In-Scope Open None
Other Operations 195364 Remove pear/mail packages from WMF MW app servers In-Scope Open None
Other Operations 133913 Completely port l10nupdate to scap In-Scope Open None
Other Operations 127797 document all puppet classes / defined types!? In-Scope Open None
Other Operations 190716 Deploying FileExporter and FileImporter In-Scope Open None
Other Operations 95053 ircecho should accept input via unix sockets In-Scope Open None
Other Operations 179192 Check analytics1037 power supply status In-Scope Open None
Other Operations 191627 Remove Cassandra 2.2.6 packages from jessie-wikimedia/thirdparty apt repo In-Scope Open None
Other Operations 135318 Document how to handle 'inconsistent state within the internal storage backends' issues In-Scope Open None
Other Operations 202705 Degraded RAID on sodium Screep Open None
Other Operations 167966 Look into feasibility of disabling sha-1 host keys on our ssh daemons In-Scope Open None
Other Operations 199406 rsyslog's in:imtcp thread stuck on old sockets Screep Open None
Other Operations 134271 Replace ircd-ratbox with something newer/maintained In-Scope Open None
Other Operations 116747 Meta task "Revamp user authentication" In-Scope Open None
Other Operations 87790 decom amslvs1-4 (dc work) In-Scope Open None
Other Operations 193473 Add HTTPS support to wdqs-internal service In-Scope Open None
Other Operations 165885 Create a cron to clean clientbucket every day or hour In-Scope Open None
Other Operations 203861 decom radium Elaborated Open None
Other Operations 128590 Cassandra uses default ip address for outbound packets while bootstrapping In-Scope Open None
Other Operations 89808 wikitech instances list is blank In-Scope Open None
Other Operations 187651 Setting packages on 'hold' breaks puppet runs In-Scope Open None
Other Operations 182228 run-no-puppet leave puppet disabled on kill/crash In-Scope Open None
Other Operations 181632 Celery manager implodes horribly if Redis goes down In-Scope Open None
Other Operations 130709 authoritative copy of 'root' files for upload.wikimedia.org is only in swift In-Scope Open None
Other Operations 111838 Some files had disappeared from Commons after renaming In-Scope Open None
Other Operations 200202 WDQS disk usage increase is correlated with reloading of categories Screep Open None
Other Operations 185814 /etc/puppet/hiera.yaml: Use of 'hiera.yaml' version 3 is deprecated. It should be converted to version 5 In-Scope Open None
Other Operations 187076 Deploy error: insufficient permission for adding an object to repository database .git/objects In-Scope Open None
Other Operations 94329 secure Cassandra/RESTBase cluster In-Scope Open None
Other Operations 193160 Monitor the BIOS boot order and parameters In-Scope Open None
Other Operations 180023 [DRAFT][RfC] Deployment of python applications in production In-Scope Open None
Other Operations 140442 reinstall rdb100[56] with RAID In-Scope Open None
Other Operations 182249 Diagnose and fix 4.5k req/min ceiling for ores* requests In-Scope Open None
Other Operations 125442 es2009 degraded RAID In-Scope Open None
Other Operations 196685 rack/setup/install rdb10[09|10].eqiad.wmnet In-Scope Open None
Other Operations 202136 Onboarding Cole White Screep Open None
Other Operations 196019 setup/install phab1002(WMF4727) In-Scope Open None
Other Operations 148614 Icinga check for Tor In-Scope Open None
Other Operations 205542 Add cumin aliases for each wdqs clusters Screep Open None
Other Operations 67270 Default license for operations/puppet In-Scope Open None
Other Operations 179856 Improve documentation for mirrors.wikimedia.org In-Scope Open None
Other Operations 149057 Designate seems very slow to delete records? In-Scope Open None
Other Operations 85451 scale graphite deployment (tracking) In-Scope Open None
Other Operations 141704 Unable to delete certain files due to "inconsistent state within the internal storage backends" In-Scope Open None
Other Operations 188098 Add Prometheus collector for Tor In-Scope Open None
Other Operations 170481 FY2017/18 Program 6 - Outcome 2 - Objective 2: Set up a continuous integration and deployment pipeline In-Scope Open None
Other Operations 199008 sql enwik gives a poor error message when db doesn't exist Screep Open None
Other Operations 121240 Network isolation for production and semi-production services In-Scope Open None
Other Operations 197172 Improve outbound mail service alerting In-Scope Open None
Other Operations 203736 Set up mailing list for Bengali Wikibooks Screep Open None
Other Operations 128615 Get rid of Tool Labs home page check from shinken In-Scope Open None
Other Operations 82350 update exim::listserve::private::mailing_lists value in puppet In-Scope Open None
Other Operations 114801 operations-apache-config-lint replacement doesn't check syntax In-Scope Open None
Other Operations 198648 Authentication for grafana Elaborated Open None
Other Operations 131966 Default gateway unreachable on baham.wikimedia.org after reboot In-Scope Open None
Other Operations 202329 SRE query: Is it possible to measure how many e-mails are sent to "black hole" e-mail addresses? Screep Open None
Other Operations 171122 librenms: consider using Distributed Poller with multiple netmon servers In-Scope Open None
Other Operations 162029 Migrate all jessie hosts to Linux 4.9 In-Scope Open None
Other Operations 203520 decommission thulium.frack.eqiad.wmnet Screep Open None
Other Operations 149287 Heating alerts for mw servers in eqiad In-Scope Open None
Other Operations 191357 decom silver/WMF3434 In-Scope Open None
Other Operations 200832 remove mathoid from scb Screep Open None
Other Operations 185195 tmpreaper doesn't play along with PrivateTmp systemd units In-Scope Open None
Other Operations 157972 Puppet fails only once when restarting ferm is not successful In-Scope Open None
Other Operations 184561 Modernize Puppet Configuration Management (2017-18 Q3 Goal) In-Scope Open None
Other Operations 197171 Graph outbound mail volume on per-service or hostgroup level In-Scope Open None
Other Operations 171157 Monitor internal CA expirations In-Scope Open None
Other Operations 196478 rack/setup/install backup1001 In-Scope Open None
Other Operations 198138 Disable agent forwarding to important hosts In-Scope Open None
Other Operations 166233 Update redis puppet class to support stretch In-Scope Open None
Other Operations 150875 Confirm attribution needs In-Scope Open None
Other Operations 156398 Decommission or repair old asw-c2-eqiad In-Scope Open None
Other Operations 170474 Decommisson and store old row D network gear. In-Scope Open None
Other Operations 162955 rebuild tools-grid-master as a large instance In-Scope Open None
Other Operations 182597 Use EtcdConfig in production to allow automation of a datacenter switch In-Scope Open None
Other Operations 187101 Setup some alert mechanism when some 'critical' cron jobs fail In-Scope Open None
Other Operations 158915 Setup reply emails for gerrit In-Scope Open None
Other Operations 187991 Have swift metrics available in Prometheus In-Scope Open None
Other Operations 161096 confctl no longer logs a non-changing state change In-Scope Open None
Other Operations 178445 flapping monitoring for recommendation_api on scb In-Scope Open None
Other Operations 161528 incident 20170323-wikibase did not trigger Icinga paging In-Scope Open None
Other Operations 201779 Have a check to prevent non-existent accounts from being added to LDAP groups Screep Open None
Other Operations 203272 cp3038, cp3039 - power supply redundancy failure Screep Open None
Other Operations 204361 Raise alert level on disk space for old elasticsearch servers Screep Open None
Other Operations 118746 Goal: Strengthen Incident monitoring infrastructure In-Scope Open None
Other Operations 179696 Homepage for https://docker-registry.wikimedia.org In-Scope Open None
Other Operations 191191 pdfrender logs to /var/log/syslog as well as to /srv/log/pdfrender In-Scope Open None
Other Operations 116805 DomainKeys Identified Mail (DKIM) for phabricator.wikimedia.org In-Scope Open None
Other Operations 163507 Intermittent DB connectivity problem on phabricator, needs investigation In-Scope Open None
Other Operations 168460 Update certificates on productions replicas of corp.wikimedia.org LDAP In-Scope Open None
Other Operations 114849 Log lines on flourine overflow at 8092 bytes. In-Scope Open None
Other Operations 167376 Decommission cp300[3456] In-Scope Open None
Other Operations 195981 require_package should mark packages as manually installed In-Scope Open None
Other Operations 184924 Utilize the deployment pipeline (stretch) In-Scope Open None
Other Operations 202504 Evaluate VMWare's Harbour as a docker registry Screep Open None
Other Operations 179501 Use external dsh group to list pooled ORES nodes In-Scope Open None
Other Operations 46016 SVG fails to render properly due to several issues In-Scope Open None
Other Operations 203260 Outdated TLS config for MXes Screep Open None
Other Operations 153279 labnet/ labtestnet2001 - disk space - nova-api.log needs rotation In-Scope Open None
Other Operations 106664 Set up role accounts and feedback loops (FBL) with all providers In-Scope Open None
Other Operations 182702 Debian Jessie reimage/install ends up in kernel panic with 8.10 netboot image In-Scope Open None