TechnicalOperations Status Report

Project Operations from 2018-04-01 to 2018-05-23

Help

Network Operations 193677 Interface errors on asw-d-codfw:xe-2/0/47 Screep Done None
Network Operations 192087 Rename lvs* LLDP port descriptions after upgrading to stretch Screep Done None
Network Operations 192125 asw1-eqsin vcp port flapping Screep Done None
Network Operations 192104 Asymetric routing for cross-DC pfw syslog-tls Screep Done None
Network Operations 191996 db1114 connection issues Screep Done None
Network Operations 193897 cr1-eqsin 4 onboard interfaces down Screep Done None
Network Operations 192114 ulsfo<->eqord BGP down Screep Done None
Network Operations 193189 New PFW policy Screep Done None
Network Operations 194858 asw-c8 rebooting Screep Done None
Network Operations 190323 Implement BGP graceful shutdown In-Scope Done None
Network Operations 176337 esams: networking audit for support contract renewal In-Scope Done None
Network Operations 157435 Review ACLs for the Analytics VLAN In-Scope Done None
Network Operations 190317 Update BGP_sanitize_in filter In-Scope Done None
Network Operations 193177 NAT for new fundraising bastion Screep Done None
Network Operations 191667 Juniper HA audit Screep Open None
Network Operations 184067 Complete router migration from cr1-esams to cr3-esams In-Scope Open None
Network Operations 183390 unrack/decom pfw1-eqiad and pfw2-eqiad In-Scope Open None
Network Operations 189689 Connection timeout from 195.77.175.64/29 to text-lb.esams.wikimedia.org In-Scope Open None
Network Operations 167691 High amount of unexpected ICMP dest unreachable toward esams cache clusters In-Scope Open None
Network Operations 83992 Juniper monitoring In-Scope Open None
Network Operations 122210 Security audit for tftp on install1001 In-Scope Open None
Network Operations 190184 Netbox: setup backups In-Scope Open None
Network Operations 167842 Find a new PIM RP IP In-Scope Open None
Network Operations 191371 Enabling graceful-switchover causes core dumps on cr1-codfw Screep Open None
Network Operations 172459 eqiad row D switch upgrade In-Scope Open None
Network Operations 170144 Evaluate NetBox as a Racktables replacement & IPAM In-Scope Open None
Network Operations 82038 create a test for multicast relay In-Scope Open None
Network Operations 181036 Pull netflow data in realtime from Kafka via Tranquillity/Spark In-Scope Open None
Network Operations 194515 eqdfw: Patch GTT cross-connect Screep Open None
Network Operations 185151 replace msw1-esams In-Scope Open None
Network Operations 189522 Detect IP address collisions In-Scope Open None
Network Operations 174596 dmz_cidr only includes some wikimedia public IP ranges, leading to some very strange behaviour In-Scope Open None
Network Operations 187962 Rack/cable/configure asw2-c-eqiad switch stack In-Scope Open None
Network Operations 186021 reconfigure esams switch port for new bastion In-Scope Open None
Network Operations 187820 Rack/cable/configure mr1-eqiad In-Scope Open None
Network Operations 133387 Enabling IGMP snooping on QFX switches breaks IPv6 (HTCP purges flood across codfw) In-Scope Open None
Network Operations 163674 Frequent RST returned by appservers to LVS hosts In-Scope Open None
Network Operations 187929 Cloud IPv6 subnets In-Scope Open None
Network Operations 98006 Anycast (Auth)DNS In-Scope Open None
Network Operations 187960 Rack/cable/configure asw2-a-eqiad switch stack In-Scope Open None
Network Operations 122406 Consider renumbering Labs to separate address spaces In-Scope Open None
Network Operations 189552 Rack/cable/configure ulsfo MX204 In-Scope Open None
Network Operations 193496 Allocate public v4 IPs for Neutron setup in eqiad Screep Open None
Network Operations 136671 Intermittent bandwidth issue to labs proxy (eqiad) from Comcast in Portland OR In-Scope Open None
Network Operations 186550 Anycast recdns In-Scope Open None
Network Operations 185171 replace mr1-eqiad In-Scope Open None
Network Operations 185926 review and fix scs config In-Scope Open None
Network Operations 190090 Offload pings to dedicated server In-Scope Open None
Network Operations 174616 set up cr3-esams In-Scope Open None
Network Operations 124843 Peer with SFMIX at ULSFO in 200 Paul In-Scope Open None
Network Operations 171032 Investigate lvs IP pages during codfw row C switch upgrade In-Scope Open None
Network Operations 120425 dumps.wikimedia.org seems to have poor throughput towards some destinations In-Scope Open None
Network Operations 180179 Evaluate the possibility to add Juniper images to Openstack In-Scope Open None
Network Operations 174637 Setup esams atlas anchor In-Scope Open None
Network Operations 106056 set up a looking glass for WMF ASes In-Scope Open None
Network Operations 190364 eqiad 10G ports needs In-Scope Open None
Network Operations 86541 setup wifi in codfw In-Scope Open None
Network Operations 167306 ospf link-protection In-Scope Open None
Network Operations 173698 Backfill librenms data in graphite with historical RRDs In-Scope Open None
Network Operations 167841 Cleanup confed BGP peerings and policies In-Scope Open None
Network Operations 150264 Icinga check for VRRP In-Scope Open None
Network Operations 189519 Audit switch ports/descriptions/enable In-Scope Open None
Network Operations 190424 modify labs-hosts1-vlans for http load of installer kernel In-Scope Open None
Network Operations 185337 rack spare switches in c1-eqiad In-Scope Open None
Traffic 193489 Refactor varnishospital and varnishslowlog Screep Done None
Traffic 184068 Procure and install LVS and miscellaneous servers In-Scope Done None
Traffic 156026 Enable Service in Asia Cache DC In-Scope Done None
Traffic 178151 Add UDP monitor for pybal In-Scope Done None
Traffic 191229 cp2022 memory replacement Screep Done None
Traffic 191225 cp2010 memory replacement Screep Done None
Traffic 121561 Encrypt Kafka traffic, and restrict access via ACLs In-Scope Done 0.0
Traffic 164376 [Discuss] Split ORES scores in datacenters based on wiki In-Scope Done None
Traffic 191223 cp2006 memory replacement Screep Done None
Traffic 180792 Remove 3DES patch from OpenSSL builds In-Scope Done None
Traffic 191226 cp2011 memory replacement Screep Done None
Traffic 191028 Thumbor incorrectly normalizes .jpe and .jpeg into .jpg for Swift thumbnail storage In-Scope Done None
Traffic 191905 eqsin hosts don't allow remote ipmi Screep Done None
Traffic 189252 Define turn-up process and scope for eqsin service to regional countries In-Scope Done None
Traffic 187014 Proxies information gone from Zero portal. Opera mini pageviews geolocating to wrong country Screep Done None
Traffic 191227 cp2017 memory replacement Screep Done None
Traffic 191224 cp2008 memory replacement Screep Done None
Traffic 193376 Gather 24h data cluster wide of AES128-SHA usage Screep Done None
Traffic 191228 cp2018 memory replacement Screep Done None
Traffic 191897 Reimage LVS servers as stretch Elaborated Done None
Traffic 190607 cp3048 hardware issues In-Scope Open None
Traffic 134807 Replace test hostnames in datecenter-specific subdomains with dashed names In-Scope Open None
Traffic 194814 Remove unnecessary response headers Screep Open None
Traffic 167906 Make API usage limits easier to understand, implement, and more adaptive to varying request costs / concurrency limiting In-Scope Open None
Traffic 23027 Requests with utf-8 in the URL return a outdated page revision In-Scope Open None
Traffic 137252 Redirect phabricator.mediawiki.org to phabricator.wikimedia.org In-Scope Open None
Traffic 174342 Missing IP addresses for Maroc Telecom In-Scope Open None
Traffic 96499 dbtree loads third party resources (from jquery.com and google.com) In-Scope Open None
Traffic 148976 Strongswan Icinga check: do not report issues about depooled hosts In-Scope Open None
Traffic 172148 Determine URL paths for Zim files In-Scope Open None
Traffic 150673 Thumb API: Varnish / CDN questions In-Scope Open None
Traffic 192368 Unconditional return(deliver) in vcl_hit Screep Open None
Traffic 133717 Letsencrypt all the prod things we can - planning In-Scope Open None
Traffic 99531 [Task] move wikiba.se webhosting to wikimedia misc-cluster In-Scope Open None
Traffic 174932 Recurrent 'mailbox lag' critical alerts and 500s In-Scope Open None
Traffic 180269 Wikimedia's recent upgrade to nginx v. 1.13.6 breaks older Android HTTP libraries In-Scope Open None
Traffic 154801 Investigate varnishd child crashes when multiple nodes get depooled/pooled concurrently In-Scope Open None
Traffic 171168 cp1050 apparently stuck while "Initializing firmware interfaces..." In-Scope Open None
Traffic 165560 Artificial spike in offset of unique devices from November to February 6th on wikidata In-Scope Open None
Traffic 177927 Refactor kafka_config.rb and and kafka_cluster_name.rb in puppet to avoid explicit hiera calls In-Scope Open None
Traffic 120509 Cache education dashboard pages In-Scope Open None
Traffic 191017 Unwanted service startups and their triggers In-Scope Open None
Traffic 178173 Renew unified certificates 2017 In-Scope Open None
Traffic 134323 confctl: give regexen more freedom In-Scope Open None
Traffic 165765 Refactor pybal/LVS config for shared failover In-Scope Open None
Traffic 180434 Uncacheable content handling: hfp vs hfm In-Scope Open None
Traffic 167239 Redirect status.wikipedia.org to status.wikimedia.org In-Scope Open None
Traffic 101002 Use Upgrade Insecure Requests on Wikimedia wikis In-Scope Open None
Traffic 179025 LVS hosts should have static-mapped IPv6 on all virtual interfaces In-Scope Open None
Traffic 167513 Redirect lzh.wikipedia to zh-classical.wikipedia In-Scope Open None
Traffic 130904 Host rewrite for /static/ not applied to purges In-Scope Open None
Traffic 184534 Cached page previews not shown when refreshed In-Scope Open None
Traffic 156462 Framework to transfer files over the LAN Screep Open None
Traffic 127573 wikiknihy.cz - transfer to Wikimedia Czech Republic? In-Scope Open None
Traffic 146332 Create short link for outreachdashboard.wmflabs.org In-Scope Open None
Traffic 141480 mixed-content issues on planet.wikimedia.org In-Scope Open None
Traffic 117826 TEST: redirect small portion of unauthenticated desktop users to mobile web In-Scope Open None
Traffic 180257 Puppet / LVS: confusion in service vs IP name In-Scope Open None
Traffic 111588 RFC: API-driven web front-end In-Scope Open None
Traffic 147648 Unexplained increase in thumbnail 500s In-Scope Open None
Traffic 101525 Set up LVS for current AuthDNS In-Scope Open None
Traffic 153468 Ferm/DNS library weirdness causing puppet errors on some deployment-prep instances In-Scope Open None
Traffic 137979 Support brotli compression In-Scope Open None
Traffic 120121 Improve Varnish XFF processing for trusted proxies In-Scope Open None
Traffic 188561 SSL cert for links.email.wikimedia.org In-Scope Open None
Traffic 179953 cp3043 disk failure In-Scope Open None
Traffic 152622 Wikipedia.cz and other domains owned by WMCZ have invalid certificate In-Scope Open None
Traffic 161360 404 loading images from Virgin Media In-Scope Open None
Traffic 182993 TLS security review of the Kafka stack In-Scope Open 13.0
Traffic 192082 lvs2006 Embedded Flash/SD-CARD iLO errors Screep Open None
Traffic 147202 Removing support for AES128-SHA TLS cipher In-Scope Open None
Traffic 172123 Determine how to upload Zim files to Swift infrastructure In-Scope Open None
Traffic 188087 Some etcd connections not established at startup In-Scope Open None
Traffic 193445 Update Media dashboard in Grafana to use Prometheus metrics Elaborated Open None
Traffic 164327 replace ulsfo aging servers In-Scope Open None
Traffic 133548 Create a secure redirect service for large count of non-canonical / junk domains In-Scope Open None
Traffic 164768 Explicitly limit varnishd transient storage In-Scope Open None
Traffic 86915 nan and minnan subdomain redirects are a mess In-Scope Open None
Traffic 159412 Convert all of our site.pp/roles to the role/profile paradigm In-Scope Open None
Traffic 171850 Backport ipvsadm In-Scope Open None
Traffic 173966 Like nan.wikipedia.org, redirect other nan.*.org to the proper zh-min-nan.*.org domains In-Scope Open None
Traffic 161256 multi-component wmflabs.org subdomains doesn't work under simple wildcard TLS cert In-Scope Open None
Traffic 131894 Collect Backend-Timing in Graphite (or Prometheus) In-Scope Open None
Traffic 120631 Security: Is it safe to enable Zero spoofing In-Scope Open None
Traffic 138546 Backend naming in VCL needs to use fqdn+port In-Scope Open None
Traffic 83467 LVS testing needs to include internal services testing In-Scope Open None
Traffic 128358 Uploading 1.2GB ogv results in 503 In-Scope Open None
Traffic 165252 cp1053 possible hardware issues In-Scope Open None
Traffic 164460 Use DNS discovery record for deployment CNAME In-Scope Open None
Traffic 158604 Investigate usefulness of SameSite cookies for logged-in accounts In-Scope Open None
Traffic 176366 Decom cp4005-8,13-16 (8 nodes) In-Scope Open None
Traffic 122867 Evaluate the feasibility of cache invalidation for the action API In-Scope Open None
Traffic 187157 cp5006 unresponsive In-Scope Open None
Traffic 91820 Create HTTP verb and sticky cookie DC routing in VCL In-Scope Open None
Traffic 137747 Parametrization of VCL is inconsistent In-Scope Open None
Traffic 180329 Add CI to all operations/software/varnish/* repositories and archive obsolete ones In-Scope Open None
Traffic 146619 DNS domains registered to WMF no longer redirecting In-Scope Open None
Traffic 152091 Block hotlinking In-Scope Open None
Traffic 138591 Backport iproute2 4.x from debian testing -> our jessie In-Scope Open None
Traffic 184293 rack/setup/install lvs101[3-6] In-Scope Open None
Traffic 132629 Data passed to HHVM ($_SERVER variables) is a mixed bag of already-decoded and non-decoded nonsense In-Scope Open None
Traffic 183177 memory errors not showing in icinga In-Scope Open None
Traffic 141266 letsencrypt puppetization: add parallel rsa+ecdsa cert support In-Scope Open None
Traffic 144508 Point wikipedia.in to 205.147.101.160 instead of URL forward In-Scope Open None
Traffic 109331 Deleted files sometimes remain visible to non-privileged users if permanently linked In-Scope Open None
Traffic 193521 Consider adding expect-CT: header to enforce certificate transparency Screep Open None
Traffic 188804 Investigate and fix odd uri_host values In-Scope Open None
Traffic 154702 Fix broken referer categorization for visits from Safari browsers In-Scope Open None
Traffic 170567 Support TLSv1.3 In-Scope Open None
Traffic 125226 [feature request] Redirect root API path to docs page In-Scope Open None
Traffic 97051 adding new languages to DNS langs.tmpl doesn't work until zone template is edited as well In-Scope Open None
Traffic 178815 decom cp40(09|1[078]) In-Scope Open None
Traffic 188807 Update documentation for "https" field in X-Analytics In-Scope Open None
Traffic 194380 Identify bots using AES128-SHA maintainers running on toolforge Screep Open None
Traffic 154017 compile number of http uses for http://www.wikidata.org/entity In-Scope Open None
Traffic 114104 pybal doesn't fully manage LVS table leaving stale services (on IP change) In-Scope Open None
Traffic 146832 Clarify caching to enable direct Wikidata Query Service access by <mapframe/link> In-Scope Open None
Traffic 147967 The WMF-Last-Access Set-Cookie header should follow RFC 2965 syntax rather than the pre-RFC Netscape format In-Scope Open None
Traffic 138093 Investigate query parameter normalization for MW/services In-Scope Open None
Traffic 178592 decommission/replace bast4001.wikimedia.org In-Scope Open None
Traffic 99216 Please set up a CNAME for videoserver.wikimedia.org to Video Editing Server In-Scope Open None
Traffic 172103 IPVS issues with UDP services, pybal depooling strategy In-Scope Open None
Traffic 120486 add a https-only option to dynamicproxy In-Scope Open None
Traffic 191393 Puppet: tlsproxy localssl default_server make a Notify at each run Screep Open None
Traffic 190244 en-wp.org certificate error In-Scope Open None
Traffic 119038 Image cache issue when 'over-writing' an image on commons In-Scope Open None
Traffic 161517 Allow anonymous users to change interface language on Commons with ULS In-Scope Open None
Traffic 157786 Unhandled error stopping pybal: 'RunCommandMonitoringProtocol' object has no attribute 'checkCall' In-Scope Open None
Traffic 123854 Set up action API latency / error rate metrics & alerts In-Scope Open None
Traffic 180921 Referrer policy for browsers which only support the old spec In-Scope Open None
Traffic 133001 Decom legacy ex-parsoidcache cxserver, citoid, and restbase service hostnames In-Scope Open 0.0
Traffic 82849 lvs servers report 'Memory allocation problem' on bootup In-Scope Open None
Traffic 150022 thumb.php should not set CC:no-cache on renderer 404 responses? In-Scope Open None
Traffic 117435 Spike: CentralNotice: Verify that our Special:HideBanners cookie storm works as efficiently as possible In-Scope Open 2.0
Traffic 129682 Look into solutions for replaying traffic to testing environment(s) In-Scope Open None
Traffic 180712 VCL: handling of uncacheable responses in wikimedia-common In-Scope Open None
Traffic 119372 Pybal IdleConnectionMonitor with TCP KeepAlive shows random fails if more than 100 servers are involved. In-Scope Open None
Traffic 162818 icinga alerts on nodejs services when a recdns server is depooled In-Scope Open None
Traffic 112316 Configure varnish to use "Unconfigured domain" page for 404 Not Served (instead of generic error) In-Scope Open None
Traffic 134447 letsencrypt puppetization: upgrade for scalability In-Scope Open None
Traffic 45250 Redo /beacon/impression system (formerly Special:RecordImpression) to remove extra round trips on all FR impressions (title was: S:RI should pyroperish) In-Scope Open None
Traffic 181368 Log source port for anonymous users and expose it for sysops/checkusers In-Scope Open None
Traffic 94125 Central login notice appears on unencrypted API format=*fm pages, where reloading does not affect login status In-Scope Open None
Traffic 191940 Investigate 2018-04-10 global traffic drop Screep Open None
Traffic 134324 confctl select needs a -y flag? In-Scope Open None
Traffic 163541 cache hosts should auto-repool iff OCSP files are sane In-Scope Open None
Traffic 174960 Varnish does not vary elasticsearch query by request body In-Scope Open None
Traffic 179026 LVS IPv6 IPs should all be recorded in DNS In-Scope Open None
Traffic 75944 Monitor Varnish caches on beta cluster have two varnishd process running In-Scope Open None
Traffic 74186 Varnish: Mobile site redirect interferes with OAuth authorization process In-Scope Open None
Traffic 186732 Decide on Cache-Control headers for map tiles In-Scope Open None
Traffic 79730 Add pybal check to ensure service IP is bound In-Scope Open None
Traffic 136703 Add LVS public endpoint checks that bypass caches In-Scope Open None
Traffic 179197 Investigate what caused the the unattended varnish upgrade in Beta Cluster In-Scope Open None
Traffic 140365 Lower geodns TTLs from 600 (10min) to 300 (5min) In-Scope Open None
Traffic 98165 Figure out an etcd deploy strategy that includes multi DC failure scenarios. In-Scope Open None
Traffic 119366 Disable caching on the main page for anonymous users In-Scope Open None
Traffic 190843 Spammy events coming our way for sites such us https://ru.wikipedia.kim In-Scope Open None
Traffic 148134 OCSP Stapling for Intermediates In-Scope Open None
Traffic 91372 $wgMFAnonymousEditing = true is sometimes not respected: cache? In-Scope Open None
Traffic 184942 Deprecate python varnish cachestats In-Scope Open None
Traffic 192559 Establish timeline and methodology for upcoming deprecation of non-forward-secret ciphers and TLSv1.0 Screep Open None
Traffic 165764 Fully-redundant LVS clusters using Pybal per-service MED feature In-Scope Open None
Traffic 190993 Upgrade pybal-test instances to stretch In-Scope Open None
Traffic 175319 cp1066 unexplained 503 spikes In-Scope Open None
Traffic 124418 Investigate massive increase in htmlCacheUpdate jobs in Dec/Jan In-Scope Open None
Traffic 125170 Internal DNS resolver responds with NXDOMAIN for localhost AAAA In-Scope Open None
Traffic 102367 Migrate tools.wmflabs.org to https only (and set HSTS) In-Scope Open None
Traffic 194757 cp1068 memory correctable errors Screep Open None
Traffic 178535 decommission lvs400[1-4].ulsfo.wmnet In-Scope Open None
Traffic 136944 Set up LVS connection sync In-Scope Open None
Traffic 147209 etcd cluster has Raft Internal errors sporadically In-Scope Open None
Traffic 170605 Unable to render file from upload.wikimedia.org "Error 349 ERR_RESPONSE_HEADERS_MULTIPLE_CONTENT_DISPOSITION" In-Scope Open None
Traffic 180655 Phabricator and Gerrit: Improve the way that maintenance downtime is communicated to users. In-Scope Open None
Traffic 195327 Normalise the Accept-Language header for REST API requests Screep Open None
Traffic 161148 AuthDNS CM/CI refactor In-Scope Open None
Traffic 169765 pybal should automatically reconnect to etcd In-Scope Open None
Traffic 185239 Puppet hosts with signed certificate present on agent but not master In-Scope Open None
Traffic 153563 Consider switching to HTTPS for Wikidata query service links In-Scope Open None
Traffic 192280 sda failure in hydrogen.wikimedia.org Screep Open None
Traffic 66214 Define an official thumb API In-Scope Open None
Traffic 185350 Vet reliability of the response_size field for data analysis purposes In-Scope Open None
Traffic 148422 cp3009: memory scrubbing error In-Scope Open None
Traffic 96844 Update TLS/HTTP documentation on wikitech In-Scope Open None
Traffic 108580 HTTPS for internal service traffic In-Scope Open None
Traffic 168539 Unhandled pybal error: OpenSSL.SSL.Error - ssl handshake failure In-Scope Open None
Traffic 189290 Tune systemd journal rate limiting for PyBal In-Scope Open None
Traffic 194031 Setup a new PKI software as an alternative to the puppet CA for managing services certificates Screep Open None
Traffic 128188 Make CI run Varnish VCL tests In-Scope Open None
Traffic 128559 store.wikimedia.org HTTPS issues In-Scope Open None
Traffic 164259 Add VSL error counters to Varnishkafka stats In-Scope Open None
Traffic 171498 Implement machine-local forwarding DNS caches In-Scope Open None
Traffic 149847 RFC: Use content hash based image / thumb URLs In-Scope Open None
Traffic 118468 point wikilovesmonuments.org ns to wmf In-Scope Open None
Traffic 172124 PyBal Feature: progressive depooling strategy for monitored failures In-Scope Open None
Traffic 155314 Varnish does not cache Action API responses when logged in In-Scope Open None
Traffic 88861 wikipedia.lol In-Scope Open None
Traffic 36670 Check all wikis for inclusions of http resources on https In-Scope Open None
Traffic 166782 wikimediafoundation.org's language selector is confusing to most visitors who don't have accounts there In-Scope Open None
Traffic 118181 Planning for phasing out non-Forward-Secret TLS ciphers In-Scope Open None
Traffic 179050 setup bast4002/WMF7218 In-Scope Open None
Traffic 89838 Move proxy IP lists to META for Varnish XFF decoding In-Scope Open None
Traffic 193865 Enable numa_networking on all caches Screep Open None
Traffic 194962 Create and deploy a centralized letsencrypt service Screep Open None
Traffic 171470 Monitor DNS delegations In-Scope Open None
Traffic 78963 Support ESI for ResourceLoader In-Scope Open None
Traffic 129839 restrict upload cache access for private wikis In-Scope Open None
Traffic 144187 Better handling for one-hit-wonder objects In-Scope Open None
Traffic 128374 Sort out analytics service dependency issues for cp* cache hosts In-Scope Open None
Traffic 181315 Varnish HTTP response from app servers taking 160s (only 0.031s inside Apache) In-Scope Open None
Traffic 158599 Samsung Internet's desktop mode getting redirected to mobile site In-Scope Open None
Traffic 56783 Respect X-Forwarded-For only from trustworthy sources In-Scope Open None
Traffic 189305 cp3034: Uncorrectable Memory Error In-Scope Open None
Traffic 106517 upload.wikimedia.org returns HTTP status code 503 for truncated urls, not 404 In-Scope Open None
Traffic 109776 Tilerator should purge Varnish cache In-Scope Open None
Traffic 133895 Varnish configuration for mobile domains should be coherent with Apache configuration In-Scope Open None
Traffic 133410 Deploy TemplateStyles to WMF production In-Scope Open None
Traffic 109325 Outbound HTTPS for varnish backend instances In-Scope Open None
Traffic 194724 Deprecate `base::service_unit` in puppet Screep Open None
Traffic 167400 Disable serving unpatrolled new files to Wikipedia Zero users In-Scope Open None
Traffic 134893 Unhandled pybal error causing services to be depooled in etcd but not in lvs In-Scope Open None
Traffic 179027 Puppetize LVS interface IP sets per-DC for easy use in ferm rules In-Scope Open None
Traffic 152882 Many misc wikis lack mobile domains In-Scope Open None
Traffic 176388 pybal: race condition in alerts instrumentation In-Scope Open None
Traffic 105657 Expires header for load.php should be relative to request time instead of cache time In-Scope Open None
Traffic 78421 m.{project}.org portal/redirect consistency In-Scope Open None
Traffic 190992 prometheus: slow dashboards due to suboptimal query_range performance In-Scope Open None
Traffic 192206 Remove wildcard vhost for *.wikimedia.org Screep Open None
Traffic 167972 Respect host header in RESTBase, and redirect /rest_v1 to /rest_v1/ In-Scope Open None
Traffic 194965 gdnsd plugin support for ACME DNS challenges Screep Open None
Traffic 147162 upload.wikimedia.org returns HTTP 501 instead of 416 for non-satisfiable byte ranges In-Scope Open None
Traffic 143562 High number of failed inbound TFO connections in esams Mon-Fri In-Scope Open None
Traffic 183554 Unified certs bloat reduction? In-Scope Open None
Traffic 181569 rack/setup scs-eqsin.mgmt.eqsin.wmnet In-Scope Open None
Traffic 188776 Move Foundation Wiki to new URL when new Wikimedia Foundation website launches In-Scope Open None
Traffic 189250 WP Zero workarounds for eqsin In-Scope Open None
Traffic 177961 Upgrade LVS servers to stretch In-Scope Open None
Traffic 175636 prometheus -> grafana stats for per-numa-node meminfo In-Scope Open None
Traffic 175203 Implement stateless TCP balancing in our LVS servers In-Scope Open None
Traffic 149873 CentralNotice: Review and update Varnish caching for Special:BannerLoader In-Scope Open 2.0
Traffic 128409 Detect tools.wmflabs.org tools which are HTTP-only In-Scope Open None
Traffic 127482 Enable VCL source-DC switching via confd In-Scope Open None
Traffic 174432 Unclear LVS bandwidth graph in "load balancers" dashboard In-Scope Open None
Traffic 177742 Investigate Chrony as a replacement for ISC ntpd In-Scope Open None
Traffic 192437 Pybal support of configuration from the kubernetes API Screep Open None
Traffic 133178 RESTBase support for www.wikimedia.org missing In-Scope Open None
Traffic 163141 dbtree: make wasat a working backend and become active-active In-Scope Open None
Traffic 102178 Fix RESTBase support for wikitech.wikimedia.org In-Scope Open None
Traffic 164456 Migrate to nginx-light In-Scope Open None
Traffic 113817 Connect Hadoop records of the same request coming via different channels In-Scope Open None
Traffic 50133 ForeignAPIRepo wrongly returns non-protocol-relative URLs for original "thumbs" In-Scope Open None
Traffic 164609 Merge cache_misc into cache_text functionally In-Scope Open None
Traffic 131930 Set SPF (... -all) for toolserver.org In-Scope Open None
Traffic 127387 Split slash decoding from general percent normalization in Varnish VCL In-Scope Open None
Traffic 104681 HTTPS Plans (tracking / high-level info) In-Scope Open None
Traffic 133821 Content purges are unreliable In-Scope Open None
Traffic 107236 Switch port 80 to nginx on primary clusters In-Scope Open None
Traffic 192555 Begin execution of non-forward-secret ciphers deprecation Elaborated Open None
Traffic 159411 Uniform cluster nomenclature across puppet In-Scope Open None
Traffic 104442 Investigate better DNS cache/lookup solutions In-Scope Open None
Traffic 54253 Protocol-relative URLs are poorly supported or unsupported by a number of HTTP clients In-Scope Open None
Traffic 150479 Prometheus varnish metric churn due to VCL reloads In-Scope Open None
Traffic 81305 Make PyBal respect advertised BGP capabilities In-Scope Open None
Traffic 102848 Split GeoIP into a new component In-Scope Open None
Traffic 180407 Change "CP" cookie from subdomain to project level In-Scope Open None
Traffic 176875 Allow access to wdqs.svc.eqiad.wmnet on port 8888 In-Scope Open None
DBA 191977 remote ipmi doesn't work for es2013 Screep Done None
DBA 194103 Degraded RAID on db2067 Screep Done None
DBA 194197 Degraded RAID on db1073 Screep Done None
DBA 194155 Degraded RAID on db1073 Screep Done None
DBA 191892 Reduce locking contention on deletion of pages Screep Done None
DBA 191720 Degraded RAID on db2069 Screep Done None
DBA 193331 db1098 crashed and got rebooted Screep Done None
DBA 191193 Move masters away from codfw C6 Screep Done None
DBA 193747 Degraded RAID on db1063 Screep Done None
DBA 194766 https://tendril.wikimedia.org/ IPv6 doesn't work Screep Done None
DBA 191792 Rack and setup db1116 - db1123 Elaborated Done None
DBA 115982 Drop the tables old_growth, hitcounter, click_tracking, click_tracking_user_properties from enwiki, maybe other schemas In-Scope Done None
DBA 194698 Degraded RAID on db1065 Screep Done None
DBA 54932 Drop *_old database tables from Wikimedia wikis In-Scope Done None
DBA 194955 Degraded RAID on db1066 Screep Done None
DBA 194852 Possibly BBU issues on db1067 Screep Done None
DBA 184797 Move mariadb_maintenance away from terbium/wasat (mediawiki_maintenance) In-Scope Done None
DBA 184262 Decommission db1039 In-Scope Done None
DBA 193847 Move db1066 to row A Screep Done None
DBA 193835 Move db1067 to row C Screep Done None
DBA 194885 Degraded RAID on db1064 Screep Done None
DBA 191875 Deletion not working on English Wikipedia Screep Done None
DBA 182556 Decommission db1034 In-Scope Done None
DBA 193325 db2081 crashed/rebooted, probably due to hardware failure Screep Done None
DBA 153440 Create a full backup of all external storage records that would be easy to restore/setup a temporary delayed slave In-Scope Done None
DBA 187886 Decommission db2011 In-Scope Done None
DBA 127570 Rename be_x_oldwiki database to be_taraskwiki In-Scope Open None
DBA 162070 Cleanup or remove mysql puppet module; repurpose mariadb module to cover misc use cases In-Scope Open None
DBA 50930 Database replication problems - production and labs (tracking) In-Scope Open None
DBA 194781 rack/setup/install db209[45].codfw.wmnet (sanitarium expansion) Elaborated Open None
DBA 107610 Setup separate logical External Store for Flow in production In-Scope Open None
DBA 190704 Convert all sanitarium hosts to multi-instance and increase its reliability/redundancy In-Scope Open None
DBA 83609 script & docs to rename wiki databases In-Scope Open None
DBA 126252 Populate the wikishared db on all dbstores In-Scope Open None
DBA 175672 Make apache/maintenance hosts TLS connections to mariadb work In-Scope Open None
DBA 141968 Display lag on grafana (prometheus) and dbtree from pt-heartbeat instead (or in addition) of Seconds_Behind_Master In-Scope Open None
DBA 141255 Separate host lookup from the sql shell script In-Scope Open None
DBA 177779 Generate instance list of database hosts to be monitored automatically from exported resources In-Scope Open None
DBA 161754 eqiad: (2) hardware access request for labsdb1004 & 5 refresh In-Scope Open None
DBA 151999 Create script to monitor db dumps for backups are successful In-Scope Open None
DBA 143896 MySQL metrics monitoring In-Scope Open None
DBA 184805 Move some wikis to s5 In-Scope Open None
DBA 109179 Migrate MySQLs to use ROW-based replication In-Scope Open None
DBA 189107 DB meta task for next DC failover issues In-Scope Open None
DBA 166108 x1 master db1031: Faulty BBU In-Scope Open None
DBA 164834 In some database hosts, performance schema loses digest statistics In-Scope Open None
DBA 185084 Allow use of EtcdConfig to configure slave databases In-Scope Open None
DBA 152427 Create a check/calendar alert for MariaDB TLS certs In-Scope Open None
DBA 157702 Followup for TLS MariaDB server roll-out In-Scope Open None
DBA 134809 Apache <=> mariadb SSL/TLS for cross-datacenter writes In-Scope Open None
DBA 104699 Firewall configurations for database hosts In-Scope Open None
DBA 193224 Evaluate and decide the future of relational datastore at WMF after the upgrade of MariaDB 10.1 is finished Screep Open None
DBA 100501 mysql user and group should be a system user/group In-Scope Open None
DBA 112473 Better mysql monitoring for number of connections and processlist strange patterns In-Scope Open None
DBA 134476 Decommission old coredb machines (<=db1050) In-Scope Open None
DBA 193732 Decommission db1060 Screep Open None
DBA 184054 Decommission db1029 and db1031 In-Scope Open None
DBA 141547 Setup automatic failover for misc database servers In-Scope Open None
DBA 194780 rack/setup/install db112[45].eqiad.wmnet (sanitarium expansion) Elaborated Open None
DBA 171071 Perform testing for TLS effect on connection rate In-Scope Open None
DBA 160731 Decom db1048 (BBU Faulty - slave lagging) In-Scope Open None
DBA 178877 operations/software repo: flake8 check In-Scope Open None
DBA 119154 Move echo tables from local wiki databases onto extension1 cluster for mediawikiwiki, metawiki, and officewiki In-Scope Open None
DBA 195228 db2064 crashed and totally broken - decommission it Screep Open None
DBA 165677 Create a backend check for pybal to monitor the MySQL protocol being up In-Scope Open None
DBA 163339 pdu phase inbalances: ps1-a3-codfw, ps1-c6-codfw, & ps1-d6-codfw In-Scope Open None
DBA 161755 eqiad: (2) hardware access request for labsdb1006 & 7 refresh In-Scope Open None
DBA 145072 Create a script to regenerate prometheus mysqld exporter listing that works with puppetdb In-Scope Open None
DBA 194118 Decommission db1055 Screep Open None
DBA 157359 labsdb1006/1007 (postgresql) maintenance In-Scope Open None
DBA 128821 reclaim and return all cisco servers In-Scope Open None
DBA 165674 Investigate slow servermon updating queries on db1016 In-Scope Open None
DBA 165625 Evaluate future of wmf puppet module "mysql" In-Scope Open None
DBA 119626 Eliminate SPOF at the main database infrastructure In-Scope Open None
DBA 133523 [RFC] improve parsercache replication and sharding handling In-Scope Open None
DBA 135851 Preserve InnoDB table auto_increment on restart In-Scope Open None
DBA 176532 Gerrit is failing to connect to db on gerrit2001 thus preventing systemd from working In-Scope Open None
DBA 112282 Multiple pages with no revisions In-Scope Open None
Software Development 144169 Flake8 for python files without extension in puppet repo In-Scope Open None
Software Development 191300 Debmonitor: deploy the agent across the fleet Screep Open None
Software Development 184435 Puppet tox: properly lint both Py2 and Py3 files In-Scope Open None
Software Development 182028 DNS repo: add CI checks for obvious configuration errors In-Scope Open None
Software Development 191298 Release and deploy Debmonitor (patch management software) [Technology Goal 2017-18_Q4] Screep Open None
Software Development 157133 Consider adding a --skip-conftool option to puppet-merge In-Scope Open None
Software Development 191299 Debmonitor: deploy the service in production Screep Open None
Software Development 177385 Upgrade Cumin masters to stretch In-Scope Open None
Software Development 167504 New tool to track package updates/status for hosts and images (debmonitor) In-Scope Open None
Software Development 159045 Update Puppet repo code that uses maniphest.update and maniphest.createtask conduit api In-Scope Open None
Software Development 155705 confctl: log to SAL even if the selection doesn't match any host In-Scope Open None
Software Development 150560 More verbose messages from service-checker-swagger In-Scope Open None
Software Development 154776 Puppet compiler: order resources for easy comparison between hosts In-Scope Open None
Software Development 152950 E901 SyntaxError: invalid syntax is wrongly raised on using python's abc by jenkins python CI linter In-Scope Open None
Software Development 164587 cumin could use randomization/splay options In-Scope Open None
Software Development 148494 Add shell scripts CI validations In-Scope Open None
Software Development 157001 Puppet compiler: abort on git rebase conflict In-Scope Open None
Software Development 157002 Puppet compiler: re-add the concurrency option NUM_THREADS In-Scope Open None
Hardware Requests 175150 Decommission stat1003.eqiad.wmnet In-Scope Done None
Hardware Requests 182991 New WDQS clusters eqiad + codfw In-Scope Done None
Hardware Requests 159996 decom fluorine In-Scope Done None
Hardware Requests 191359 decom spare server iodine Elaborated Done None
Hardware Requests 191363 decom spare server nobelium/wmf4543 Elaborated Done None
Hardware Requests 192185 request to assign spare systems as terbium equivalent Screep Done None
Hardware Requests 175595 decommission wdqs100[12] In-Scope Done None
Hardware Requests 175093 Decommission osmium.eqiad.wmnet In-Scope Done None
Hardware Requests 188075 eqiad/codfw: (4)+(4) hardware access request for videoscalers In-Scope Done None
Hardware Requests 181936 Give misc dump crons their own host In-Scope Open None
Hardware Requests 181264 Refresh or replace oxygen In-Scope Open None
Hardware Requests 178392 Replacement hardware for cumin masters In-Scope Open None
Hardware Requests 139775 eqiad: add all spare network switches to hardware spares tracking In-Scope Open None
Other Operations 184592 Setup reply via email in discourse-mediawiki.wmflabs.org In-Scope Cut None
Other Operations 192472 Requesting access to analytics servers for mepps Screep Done None
Other Operations 194461 Delete "ltakb-admins" mailing list Screep Done None
Other Operations 162013 etcd cluster in codfw has raft consensus issues In-Scope Done None
Other Operations 192531 puppetdb does not start up on reboot Screep Done None
Other Operations 192115 sudoer access for pnorman on maps servers Screep Done None
Other Operations 172538 rack/setup/install labvirt10(19|20).eqiad.wmnet In-Scope Done None
Other Operations 146285 Switch mwscript from Zend PHP5 to default php alternative (e.g. HHVM or PHP7) In-Scope Done None
Other Operations 191523 Request to add Matthias Geisler to the ldap/wmde group Screep Done None
Other Operations 195216 Interface errors on pfw3a-codfw:xe-0/0/17 Screep Done None
Other Operations 189674 Create wikibaseug mailing list Screep Done None
Other Operations 181121 Kernels errors on ganeti1005- ganeti1008 under high I/O In-Scope Done None
Other Operations 122144 Move most (all?) exim personal aliases to OIT In-Scope Done None
Other Operations 192983 Site: 4 VM request for pdf-render/proton Elaborated Done None
Other Operations 165781 rack/setup/install labcontrol100[34] In-Scope Done None
Other Operations 149845 Something is wrong with installer root disk stuff In-Scope Done None
Other Operations 193579 Update and move labnet1001/1002 Screep Done None
Other Operations 181728 Stop using jmx_exporter deployed via scap in favour of Debian package In-Scope Done None
Other Operations 193254 Global renames get stuck at metawiki Screep Done None
Other Operations 127549 move travel related aliases to OIT In-Scope Done None
Other Operations 194798 furud: disconnect furud-array[3-7]; connect furud-array[1-2] Screep Done None
Other Operations 143931 Update ICU version to 55.1 In-Scope Done None
Other Operations 192684 add arlo and scott to parsoid releasers admin group Elaborated Done None
Other Operations 194418 Certain graphite data directories should be backed up Screep Done None
Other Operations 191767 Important critical Etherpad release – 1.6.4 Screep Done None
Other Operations 187174 Remove darkbluesky from being mailinglist admin on arbcom-ko (cannot access anymore) Screep Done None
Other Operations 192686 Beta Cluster sends password reset mails with prod address Screep Done None
Other Operations 191453 grant thcipriani RelEng root on contint1001 Screep Done None
Other Operations 191704 Access to the deployment hosts for Imarlier Screep Done None
Other Operations 191673 Update SSH key in production hosts for @Sharvaniharan Screep Done None
Other Operations 190081 rack/setup/install ms-be104[0-3].eqiad.wmnet In-Scope Done None
Other Operations 193891 rigel.frack.codfw.wmnet (fundraising codfw bastion) will not boot after a power cycle Screep Done None
Other Operations 191467 Retire nitrogen and nihal ganeti VMs Screep Done None
Other Operations 150532 Upgrade qemu on ganeti clusters to 2.8 In-Scope Done None
Other Operations 192071 Upgrade deployment-prep appserver fleet to Debian Stretch (using HHVM) Elaborated Done None
Other Operations 194445 Give Seddon access to the analytics cluster Screep Done None
Other Operations 144006 Move the MW Beta appservers to Debian In-Scope Done None
Other Operations 193422 Clean up deprecated shared virtualenv directories Elaborated Done None
Other Operations 191972 Scap sync-file failing for deploy1001.eqiad.wmnet Screep Done None
Other Operations 193118 Upgrade OTRS to 5.0.27 Screep Done None
Other Operations 189822 Replace 5 Samsung SSD 850 devices w/ 4 1.6T Intel or HP SSDs In-Scope Done None
Other Operations 165345 decommission indium In-Scope Done None
Other Operations 179271 Disk space checks complaining on docker build hosts when building containers In-Scope Done None
Other Operations 160892 Close arbcom-ko mailing list Screep Done None
Other Operations 192124 Deploy Scap 3.8.0 to production Screep Done None
Other Operations 191308 Access to stat100x and notebook1003.eqiad.wmnet for Jonas Kress Screep Done None
Other Operations 150672 Provide an archive endpoint for older Parsoid debs (on releases.wikimedia.org or elsewhere) In-Scope Done None
Other Operations 194426 mw2139 failed to boot - hardware check Screep Done None
Other Operations 194404 Add Michael Holloway (Reading Infrastructure) to maps admin groups Screep Done None
Other Operations 194466 Reset admin password for Wikipedia-FR-Wikimag mailinglist Screep Done None
Other Operations 192256 Add Tim_WMDE to the ldap/wmde group Screep Done None
Other Operations 165779 rack/setup/install labnet100[34] In-Scope Done None
Other Operations 191321 Remove deprecated hosts from ORES scap config Screep Done None
Other Operations 192902 Broken memory/CPU on mw1275 Screep Done None
Other Operations 192763 Create a prometheus exporter for mcrouter Screep Done None
Other Operations 194597 Mass subscription attempt to a mailing list from same domain (aol and yahoo) Screep Done None
Other Operations 177622 Multiple systems in ulsfo 1.22 showing PSU failures In-Scope Done None
Other Operations 194841 Degraded raid in labnet1002 Screep Done None
Other Operations 151702 API cluster failure / OOM In-Scope Done None
Other Operations 159354 Move coal from graphite#001 nodes to webperf#001 In-Scope Done None
Other Operations 192893 Access to Google Search Console for Go Fish Digital Screep Done None
Other Operations 193653 labtestvirt2002 eth1 showing no carrier Screep Done None
Other Operations 191177 data retrieval/write issues via NFS on dumpsdata1001, impacting some dump jobs Screep Done None
Other Operations 194390 EQIAD & CODFW: 1 VM in each data center for xhprof/xhgui/other profiling tools Screep Done None
Other Operations 181071 Cache ORES virtualenv within versioned source In-Scope Done None
Other Operations 194550 Access to usergroups for Marshall Miller Screep Done None
Other Operations 194283 Clean up graphite1001.eqiad.wmnet, now that coal has been moved Elaborated Done None
Other Operations 193651 labstore1003 SMART failure Screep Done None
Other Operations 156924 Allow integration of data from etcd into the MediaWiki configuration In-Scope Done None
Other Operations 193798 change hostname label for mw1297 to mwmaint1001 Screep Done None
Other Operations 194537 Shut down and delete the devnations-l mailing list Screep Done None
Other Operations 193660 Merge one-line puppet fix Screep Done None
Other Operations 170365 move legal-tm-vio alias to OIT In-Scope Done None
Other Operations 193771 Requesting access to Logstash for jbennett Screep Done None
Other Operations 194575 Archive "wiki-offline-reader-l" Screep Done None
Other Operations 192139 Remove monthly run of updateArticleCount.php Screep Done None
Other Operations 192287 [Bug] Beta cluster page summary endpoint sometimes reponds with 5xx Screep Done None
Other Operations 172624 Rename (recreate) mailing list for Toolforge-standards-committee Screep Done None
Other Operations 169249 /usr/local/bin/xenon-generate-svgs and flamegraph.pl cronspam In-Scope Done None
Other Operations 192799 Shall we set up a mailing list or something for the Wikibase community? Screep Done None
Other Operations 192159 Requesting access to deployment for pmiazga Screep Done None
Other Operations 191356 Request for "administrator" rights on beta cluster Screep Done None
Other Operations 191478 Requesting access to shell (snapshot, dumpsdata) for springle Screep Done None
Other Operations 152073 Check concurrency/retry/timeout limits and syncronize those between services In-Scope Done None
Other Operations 194010 mailing list request for USJP Wiki Club Screep Done None
Other Operations 137397 revisit swift (sys)logging In-Scope Done None
Other Operations 192343 Trim long output from check_prometheus_metric Screep Done None
Other Operations 187821 Choose a server for the chromium-render service In-Scope Done None
Other Operations 193919 provide proxysql for stretch, add package to puppet Screep Done None
Other Operations 188392 package prometheus-rabbitmq-exporter for Debian jessie In-Scope Done None
Other Operations 192721 Degraded RAID on ms-be2034 Screep Done None
Other Operations 193238 Kafka API negotiation errors on kafka main brokers Screep Done None
Other Operations 192060 Request to add Tarrow to the ldap/wmde group Screep Done None
Other Operations 162857 Some Core availability Catchpoint tests might be more expensive than they need to be In-Scope Done None
Other Operations 193793 Icinga SMART check returns OK when not getting data Screep Done None
Other Operations 193342 Request to create a mailing list for Wikimedia Niger Delta Screep Done None
Other Operations 192279 Kibana fails to load when using short URLs to share dashboard Screep Done None
Other Operations 186824 Start a new email list called 'Collaboration-Team' (and discontinue the old 'E2' mailing list) Screep Done None
Other Operations 128716 Make icinga-wm report Tools homepage check at #wikimedia-labs, too In-Scope Open None
Other Operations 142984 Review lists of config/sysctl recommendations by "kernel self-protection project" In-Scope Open None
Other Operations 110240 [Discussion] Consider validating JSON schemas when running x-ample tests? In-Scope Open None
Other Operations 162029 Migrate all jessie hosts to Linux 4.9 In-Scope Open None
Other Operations 176370 Migrate to PHP 7 in WMF production In-Scope Open None
Other Operations 177099 Large number of "A page you created was linked on Wikidata" emails to one recipient in short period of time In-Scope Open None
Other Operations 180853 Bring discourse.mediawiki.org to production In-Scope Open None
Other Operations 130590 Have dedicated master nodes for elasticsearch In-Scope Open None
Other Operations 151045 Extending Yubico 2FA for production use (meta bug) In-Scope Open None
Other Operations 116767 limit the impact of heavy/large graphite queries In-Scope Open None
Other Operations 170108 Operations Q1 goal: Streamlined Service Delivery In-Scope Open None
Other Operations 131832 Unable to restore file that has a very large file size In-Scope Open None
Other Operations 163673 Some swift disks wrongly mounted on 5 ms-be hosts In-Scope Open None
Other Operations 126574 puppet should try to mount all mountable swift filesystems In-Scope Open None
Other Operations 144539 Remove /srv/deployment/wdqs/wdqs/rules.log symlink In-Scope Open None
Other Operations 183146 Monitor resource usage on a per-cgroup basis In-Scope Open None
Other Operations 148986 Firewall sets not being loaded post-reboot due to a @resolve race on jessie In-Scope Open None
Other Operations 175885 Toolforge's static webserver broken by Puppet changes and stale nginx packages In-Scope Open None
Other Operations 161904 decommission backup4001 In-Scope Open None
Other Operations 94819 Audit racktables In-Scope Open None
Other Operations 184714 Puppet fail to properly refresh Icinga In-Scope Open None
Other Operations 178575 Add require_package() variant with repository component to wmflib In-Scope Open None
Other Operations 86546 graphite-web logs are not rotated In-Scope Open None
Other Operations 152100 should we make privatewiki list available to puppet without maintaining two lists? In-Scope Open None
Other Operations 119679 Rewrite http://download.wikimedia.org/mediawiki/ -> https://releases.wikimedia.org/mediawiki in less than 3 redirects In-Scope Open None
Other Operations 193738 dbstore1002 disk 5 not healthy Screep Open None
Other Operations 110171 Alert when ES indexes are freezed for more than 30 minutes In-Scope Open None
Other Operations 141704 Storage backend errors on commons when deleting/restoring pages In-Scope Open None
Other Operations 84163 Fix CirrusSearch monitoring In-Scope Open None
Other Operations 141897 Review new service 'pre-deployment to production' checklist In-Scope Open None
Other Operations 177958 Decommission ocg1001-3 In-Scope Open None
Other Operations 193628 tungsten disk 1 and 8 SMART failure Screep Open None
Other Operations 182699 Use firmware-enriched Debian installation images In-Scope Open None
Other Operations 188947 Create an LVS endpoint for jobrunners on videoscalers In-Scope Open None
Other Operations 138017 Improve automation around Maps servers In-Scope Open None
Other Operations 78135 Provide a pxe-bootable rescue image In-Scope Open None
Other Operations 191360 decom spare server lawrencium/WMF3542 Elaborated Open None
Other Operations 152632 Explore hosting the multimedia commons use case In-Scope Open None
Other Operations 171191 Should puppet auto-restart slapd? In-Scope Open None
Other Operations 119660 Set up LVS for labs dns recursors In-Scope Open None
Other Operations 115899 Move scap target configuration to etcd In-Scope Open None
Other Operations 141524 eventbus should send statsd in batches In-Scope Open None
Other Operations 86552 Monitor and alarm on SMART attributes In-Scope Open None
Other Operations 182702 Debian Jessie reimage/install ends up in kernel panic with 8.10 netboot image In-Scope Open None
Other Operations 161918 videoscalers (mw1168, mw1169) - high load / overheating In-Scope Open None
Other Operations 56515 Apply editing rate limits for all users In-Scope Open None
Other Operations 186069 Icinga: page in case all MediaWiki are throwing 5xx In-Scope Open None
Other Operations 191199 Page allocation stalls on scb1001, scb1002 Screep Open None
Other Operations 146841 Reach out to Google about @yahoo.com emails not reaching gmail inboxes (when sent to mailing lists) In-Scope Open None
Other Operations 137616 Epic: cultivating the Maps garden In-Scope Open None
Other Operations 160071 Add slabinfo prometheus exporter In-Scope Open None
Other Operations 194036 mw1230 sdb "Raw_Read_Error_Rate" SMART Screep Open None
Other Operations 187257 puppetdb4: systemd config review In-Scope Open None
Other Operations 149287 Heating alerts for mw servers in eqiad In-Scope Open None
Other Operations 55457 setup a DB backed parser cache In-Scope Open None
Other Operations 177195 Reduce technical debt in metrics monitoring In-Scope Open None
Other Operations 129847 conftool-merge should report which node is setting attributes for In-Scope Open None
Other Operations 124413 confctl should provide tags information after writing data In-Scope Open None
Other Operations 186073 Rack/setup frmon1001 In-Scope Open None
Other Operations 164341 Decommission old memcached hosts - mc1001->mc1018 In-Scope Open None
Other Operations 167422 Monitoring: add link to graph for Icinga timeseries alarms In-Scope Open None
Other Operations 181546 Let the ORES application set log severity, not uWSGI In-Scope Open None
Other Operations 120532 Use user-specific passwords for accessing EventLogging database In-Scope Open None
Other Operations 46791 [[wikitech:Server_admin_log]] should not rely on freenode irc for logmsgbot entries In-Scope Open None
Other Operations 143556 Setting up grafana should also setup Anonymous read-only access for the default org In-Scope Open None
Other Operations 191315 Cassandra Graphite metrics space usage audit and cleanup Screep Open None
Other Operations 193664 Knock down puppet 4 deprecation warnings Screep Open None
Other Operations 150020 Refactor puppet-postgresql module to use custom types In-Scope Open None
Other Operations 193420 Decommission hafnium Screep Open None
Other Operations 40860 security@mediawiki.org : Create a public key and publish it on the public key servers In-Scope Open None
Other Operations 156570 Investigate issues with wikitech-static.wikimedia.org In-Scope Open None
Other Operations 144431 RESTBase k-r-v as Cassandra anti-pattern In-Scope Open None
Other Operations 17000 Special:Import error: "Import failed: Could not open import file" In-Scope Open None
Other Operations 156140 Lots of hosts with hyperthreading disabled In-Scope Open None
Other Operations 179078 mpt raid controller not detected as fact on maps-test2* In-Scope Open None
Other Operations 135385 investigate carbon-c-relay stalls/drops towards graphite2002 In-Scope Open None
Other Operations 194998 Create custom deployment-prep role that allows editing of Designate records only Elaborated Open None
Other Operations 174720 letsencrypt::cert::integrated and non-http servers In-Scope Open None
Other Operations 194171 rdb2002 correctable memory errors Screep Open None
Other Operations 193910 Build and upload jenkins-debian-glue_0.18.4-wmf3 for jessie Screep Open None
Other Operations 142205 use granularity (g=) restrictions for wikimedia.org fundraising DKIM records In-Scope Open None
Other Operations 170628 Jessie rsvg/cairo can't render specific SVG file on Commons In-Scope Open None
Other Operations 189065 Outbound mail from Greenhouse is broken In-Scope Open None
Other Operations 122825 Service Ownership and Maintenance In-Scope Open None
Other Operations 142827 Enforce reference to Phabricator task for all commits to modules/admin/data/data.yaml In-Scope Open None
Other Operations 123560 investigate rsync between dcs with encryption In-Scope Open None
Other Operations 183814 Degraded RAID on bast3002 In-Scope Open None
Other Operations 191357 decom silver/WMF3434 Elaborated Open None
Other Operations 191921 mwscript rebuildLocalisationCache.php takes 40 minutes Screep Open None
Other Operations 185195 tmpreaper doesn't play along with PrivateTmp systemd units In-Scope Open None
Other Operations 174431 Upgrade mw* servers to Debian Stretch (using HHVM) In-Scope Open None
Other Operations 136312 Encrypt syslog traffic In-Scope Open None
Other Operations 187987 Upgrade to Prometheus 2.x In-Scope Open None
Other Operations 159524 backup space is used unwisely In-Scope Open None
Other Operations 183565 Fix regex.yaml single-regex issue In-Scope Open None
Other Operations 190568 Reimage both phab1001 and phab2001 to stretch In-Scope Open None
Other Operations 195306 Degraded RAID on elastic2020 Screep Open None
Other Operations 189629 rename role::xenon In-Scope Open None
Other Operations 182016 Decommission server zinc In-Scope Open None
Other Operations 146355 Replace etcd internal auth mechanism with a frontend proxy In-Scope Open None
Other Operations 175625 scs-c1-eqiad unresponsive In-Scope Open None
Other Operations 151009 Provide authenticated access to Prometheus native web interface In-Scope Open None
Other Operations 185644 Switch phabricator from using apache to nginx In-Scope Open None
Other Operations 165618 Audit / document reasons for not enabling HT? In-Scope Open None
Other Operations 129180 Preserve SSH host key when re-imaging hosts In-Scope Open None
Other Operations 140594 svn.wikimedia.org redirects to Diffusion main page, hence hard to find e.g. "flexbisonparse" In-Scope Open None
Other Operations 94951 Enable the usage of `hhvm -m debug --debug-host ::1` from mw1017 so developers can step through code (think gdb) in production to see what is going wrong. In-Scope Open None
Other Operations 191364 decom spare server osmium/wmf4546 Elaborated Open None
Other Operations 194966 disk usage increase on maps servers Screep Open None
Other Operations 191355 decom niobium/WMF3428 Elaborated Open None
Other Operations 180641 reinstall RT server with private IP and stretch In-Scope Open None
Other Operations 186625 apply hostname labels to bast1002/WMF4749 In-Scope Open None
Other Operations 187754 Figure out why HHVM isn't using error_document404 setting In-Scope Open None
Other Operations 187194 zotero translation server: code stewardship request In-Scope Open None
Other Operations 116063 Hardware Automation Workflow - Overall Tracking In-Scope Open None
Other Operations 194964 Connect or troubleshoot eth1 on labvirt1019 and labvirt1020 Screep Open None
Other Operations 134237 Graphoid returns a 400 on MW API time-out In-Scope Open None
Other Operations 88997 Improve graphite failover In-Scope Open None
Other Operations 177891 Update and use php-wikidiff2 1.5.1 & MovedParagraphDetectionCutoff in production In-Scope Open None
Other Operations 157972 Puppet fails only once when restarting ferm is not successful In-Scope Open None
Other Operations 181750 decommission mobile 1004 and mobile1005 In-Scope Open None
Other Operations 116627 Include 5xx numbers in fluorine fatalmonitor In-Scope Open None
Other Operations 152724 Current state and next steps for RESTBase storage In-Scope Open None
Other Operations 155761 DNS repo: add Jenkins job to ensure there are no duplicates In-Scope Open None
Other Operations 134811 Consider REST with SSL (HyperSwitch/Cassandra) for session storage In-Scope Open None
Other Operations 113792 Change LDAP cn to something more useful (was Rename "Dzahn" to "Daniel Zahn" in Gerrit) In-Scope Open None
Other Operations 184061 SRE 2017-18 Q3 goal Cleanup esams and refresh servers and infrastructure (tracking) In-Scope Open None
Other Operations 132216 Setting up bulk proxies pointing to a multiwiki mediawiki-vagrant setup running on a labs vm In-Scope Open None
Other Operations 97909 Upgrade jobrunners to redis 2.8 In-Scope Open None
Other Operations 186734 Clean up redundant ORES celery_workers defaults In-Scope Open None
Other Operations 154619 Export ipsec counters as Prometheus metrics In-Scope Open None
Other Operations 189801 setup backup1001.eqiad.wmnet In-Scope Open None
Other Operations 187658 Setup cron for foreachwikiindblist all-labs.dblist extensions/AbuseFilter/maintenance/purgeOldLogIPData.php on Beta In-Scope Open None
Other Operations 97524 ocg alarm ocg_job_status_queue 'flapping' In-Scope Open None
Other Operations 172479 Collect error logs from jobchron/jobrunner services in Logstash In-Scope Open None
Other Operations 193072 TTS server deployment strategy Screep Open None
Other Operations 165136 Ferm rules for labstore NFS hosts In-Scope Open None
Other Operations 176774 Reimage cobalt as stretch In-Scope Open None
Other Operations 183585 Rack/cable/configure asw2-b-eqiad switch stack In-Scope Open None
Other Operations 164123 tools-k8s-master-01 has two floating IPs In-Scope Open None
Other Operations 182832 Apache on phab1001 is gradually leaking worker processes which are stuck in "Gracefully finishing" state In-Scope Open None
Other Operations 119274 Check incoming requests to secure.wm.o In-Scope Open None
Other Operations 150871 [EPIC] (Proposal) Replicate core OCG features and sunset OCG service In-Scope Open None
Other Operations 138685 notebook1001 shown as DOWN in icinga, due to firewall rules In-Scope Open None
Other Operations 195252 Update prometheus-varnish-exporter on debian to 1.4 Screep Open None
Other Operations 110169 Monitor redis memory/disk usage In-Scope Open None
Other Operations 111653 Encrypt all the things In-Scope Open None
Other Operations 170298 sshd stretch puppet support In-Scope Open None
Other Operations 176335 logs sent to logstash are lost when the elasticsearch cirrus cluster is unavailable In-Scope Open None
Other Operations 169564 MD RAID: remove mdadm daily check In-Scope Open None
Other Operations 123276 URL parameters do not work with pages that have "?" in their names In-Scope Open None
Other Operations 167035 stretch acct monthly cron will spam when /var/log/wtmp.1 doesn't exist In-Scope Open None
Other Operations 184561 Modernize Puppet Configuration Management (2017-18 Q3 Goal) In-Scope Open None
Other Operations 142991 Enable "upload by url" feature at zhwiki In-Scope Open None
Other Operations 194669 Provide a mean to mass discard/reject subscription requests on Wikimedia mailing lists Screep Open None
Other Operations 141959 Moving network::external to hiera broke much of labs In-Scope Open None
Other Operations 194186 rack/setup/install cloudelastic100[1-4].eqiad.wmnet systems Screep Open None
Other Operations 122917 Provide a good download service of dumps from Wikimedia In-Scope Open None
Other Operations 129188 mw2212 unresponsive In-Scope Open None
Other Operations 188453 Google Search Console access for Search Platform team In-Scope Open None
Other Operations 165105 Some requests for DOIs are failing or very slow; if we have a DOI and the request is taking too long, just use CrossRef data instead. In-Scope Open None
Other Operations 169322 Research whether it makes sense to have OTRS installation in an HA setup In-Scope Open None
Other Operations 172584 Securing external binaries run by MediaWiki In-Scope Open None
Other Operations 170740 PuppetDB misbehaving on 2017-07-15 In-Scope Open None
Other Operations 137939 Increase frequency of OSM replication In-Scope Open None
Other Operations 104774 Publishing translations for central notice banners fails In-Scope Open 2.0
Other Operations 184066 Procure and install new PDUs In-Scope Open None
Other Operations 163068 More missing 'original' files on Commons In-Scope Open None
Other Operations 94457 Install nodejs, nginx and other dependencies on francium In-Scope Open None
Other Operations 119401 Untangle labs/production roles from labs/instance roles In-Scope Open None
Other Operations 150917 Remove deprecated features from book creator UI In-Scope Open None
Other Operations 193733 Move dispatching of wikidata to a dedicated node Screep Open None
Other Operations 185055 Stack overflow when Redis is down In-Scope Open None
Other Operations 185298 xfs_db blocked / timeout on ms-be2023 In-Scope Open None
Other Operations 182274 Create custom per-job metric reporters capability In-Scope Open None
Other Operations 192547 Improve remote IPMI monitoring Screep Open None
Other Operations 179022 Backport firejail 0.9.52 for use on Wikimedia appservers In-Scope Open None
Other Operations 171157 Monitor internal CA expirations In-Scope Open None
Other Operations 158429 Switch to predictable network interface names? In-Scope Open None
Other Operations 95801 Allow customizing the alert message from graphite In-Scope Open None
Other Operations 135128 Turn on etcd TLS for intra-cluster communications In-Scope Open None
Other Operations 190086 Decommission old server wmf4077 In-Scope Open None
Other Operations 194558 Enable CAPTCHA on mailman instances Screep Open None
Other Operations 144933 Cleanup debconf handling in mailman puppet setup In-Scope Open None
Other Operations 163354 Find a way to verify mediawiki-config IPs ahead of datacenter switchovers In-Scope Open None
Other Operations 166233 Update redis puppet class to support stretch In-Scope Open None
Other Operations 192551 atop on stretch overloading a host Screep Open None
Other Operations 131748 Refresh the appservers puppet code/configs In-Scope Open None
Other Operations 162037 Use SSL certificates with discovery entry for elasticsearch In-Scope Open None
Other Operations 191362 decom promethium/WMF3571 Elaborated Open None
Other Operations 158434 Phabricator: Make sure phabricator works properly including our puppet roles on jessie In-Scope Open None
Other Operations 95052 Make ircecho much better In-Scope Open None
Other Operations 156544 Create backups of Wikimedia content in diverse geographic places In-Scope Open None
Other Operations 156955 Standardizing our partman recipes In-Scope Open None
Other Operations 171619 ORES should use a git large file plugin for storing serialized binaries In-Scope Open None
Other Operations 179371 Move deployment-prep redis instances to stretch In-Scope Open None
Other Operations 179181 Puppet4: hiera() can only be called using the 4.x function API. In-Scope Open None
Other Operations 192771 mcrouter production architecture Screep Open None
Other Operations 193655 rack/setup/install labstore1008 & labstore1009 Screep Open None
Other Operations 149421 Long running mediawiki web requests impacts service availability, specially databases In-Scope Open None
Other Operations 176153 Create affcom-staff email account In-Scope Open None
Other Operations 177196 Port non-deprecated Diamond collectors to Prometheus In-Scope Open None
Other Operations 184462 Serve one production service via Kubernetes In-Scope Open None
Other Operations 167091 Elasticsearch errors about BulkShardRequest In-Scope Open None
Other Operations 148843 GPU upgrade for stats machine In-Scope Open None
Other Operations 133744 Epic: switch Maps to production status In-Scope Open None
Other Operations 159750 E-mail for people in different OIT LDAP object unit In-Scope Open None
Other Operations 177197 Export Prometheus-compatible JVM metrics from JVMs in production In-Scope Open None
Other Operations 116580 monitor postgresql replication status In-Scope Open None
Other Operations 102575 document graphite failover/backfill procedures In-Scope Open None
Other Operations 191659 Configure a threshold for earlier notification of /srv/cassandra/instance-data Screep Open None
Other Operations 93138 Procure hardware for Sentry In-Scope Open None
Other Operations 167412 host-vmem.erb is doing operations that make no sense In-Scope Open None
Other Operations 140942 Tracking: Monitoring and alerts for "business" metrics In-Scope Open None
Other Operations 184936 install/designate other machine as esams bastion In-Scope Open None
Other Operations 168767 Monitor PostgreSQL connection slots In-Scope Open None
Other Operations 170995 Setup a mirror for R language dependencies (CRAN) In-Scope Open 10.0
Other Operations 120585 Make l10nupdate user a system user In-Scope Open None
Other Operations 36947 Incorrect text positioning in SVG rasterization (scale/transform; font-size; kerning) In-Scope Open 0.0
Other Operations 193394 Degraded RAID on wasat Screep Open None
Other Operations 163667 Fix UIDs for deployment server users In-Scope Open None
Other Operations 119846 Redirect revisions from svn.wikimedia.org to https://phabricator.wikimedia.org/rSVN In-Scope Open None
Other Operations 125085 Split the API MediaWiki appserver pool into two external/internal pools In-Scope Open None
Other Operations 119718 Make it easier to ban misbehaving dashboards from graphite In-Scope Open None
Other Operations 193155 IPMI Audit 2018-04 Screep Open None
Other Operations 121610 system users with UIDs > 500 In-Scope Open None
Other Operations 150300 icinga notification if elevated writing to badpass.log In-Scope Open None
Other Operations 156398 Decommission or repair old asw-c2-eqiad In-Scope Open None
Other Operations 127825 Re-add intel-microcode In-Scope Open None
Other Operations 191351 decom vanadium/WMF3291 Elaborated Open None
Other Operations 170150 Evaluate Grafana's LDAP group options and deprecate grafana-admin if possible In-Scope Open None
Other Operations 64987 librsvg misinterpret quoted font family names that contain whitespaces In-Scope Open None
Other Operations 194653 Ban clients of WDQS which don't follow throttling directives for some time Screep Open None
Other Operations 133179 Redis monitoring needs to be improved In-Scope Open None
Other Operations 187434 Include apache_exporter in puppet module apache In-Scope Open None
Other Operations 133318 High levels of PoolCounter errors should trigger alerts In-Scope Open None
Other Operations 184832 Decommission labsdb1001 and labsdb1003 In-Scope Open None
Other Operations 175206 2017/18 Annual Plan Program 8: Multi-datacenter support In-Scope Open None
Other Operations 98831 Honor DNT header for access logs & varnish logs In-Scope Open None
Other Operations 181678 Plan migration of ORES repos to git-lfs In-Scope Open None
Other Operations 150356 Wikidata Query Service is overly verbose toward logstash In-Scope Open None
Other Operations 114446 move human users out of UID range for system accounts In-Scope Open None
Other Operations 159480 Decommission bast3001 In-Scope Open None
Other Operations 171048 Eventbus does not handle gracefully changes in DNS recursors In-Scope Open None
Other Operations 152782 Kibana functionality missing after upgrade: histograms In-Scope Open None
Other Operations 56713 Non-NDA users cannot access graphite.wikimedia.org In-Scope Open None
Other Operations 188985 https://meta.wikimedia.org/wiki/Special:Contact/Stewards is being abused by spammers In-Scope Open None
Other Operations 128715 Add other Tools administrators to the Icinga notification group In-Scope Open None
Other Operations 175876 document all scs connections In-Scope Open None
Other Operations 170474 Decommisson and store old row D network gear. In-Scope Open None
Other Operations 191648 uwsgi::app sorts config keys, but the .ini file behavior depends on order Screep Open None
Other Operations 84845 improve cron spam visibility In-Scope Open None
Other Operations 192610 prometheus on bast3002 misbehaving Screep Open None
Other Operations 140270 Determine a core set or a checklist of permissions for deployment purpose In-Scope Open None
Other Operations 162850 CPU throttling on DELL PowerEdge R320 In-Scope Open None
Other Operations 111595 Do not apply spam headers on email assessed NOT to be spam In-Scope Open None
Other Operations 175288 setup/install/deploy deploy1001 as deployment server In-Scope Open None
Other Operations 162122 Swiftrepl was stuck in an infinite loop since days In-Scope Open None
Other Operations 178839 New upstream jvm-tools In-Scope Open None
Other Operations 189921 decom californium In-Scope Open None
Other Operations 148061 Feasibility of hosting podcast setup on Wikimedia servers In-Scope Open None
Other Operations 193915 rename wasat to mwmaint2001 and reinstall it with stretch Screep Open None
Other Operations 134326 udpmxircecho should write stats of messages processed and we should alert when that drops to zero In-Scope Open None
Other Operations 161864 404 error while accessing some images files e.g. djvu and jpg In-Scope Open None
Other Operations 191018 Provide an option menu when booting via PXE In-Scope Open None
Other Operations 160060 Icinga check for sysctl settings In-Scope Open None
Other Operations 192092 setup replacement for terbium (maintenance_server) on stretch Screep Open None
Other Operations 191438 Upgrade Puppet compilers to Stretch Screep Open None
Other Operations 135338 On Trusty and Jessie PHP yields: PHP Deprecated: Comments starting with '#' are deprecated in /etc/php5/cli/conf.d/20-xhprof.ini on line 2 In-Scope Open None
Other Operations 185236 Password Vault for Security Team In-Scope Open None
Other Operations 149885 Investigate Swift as a storage backend for maps tiles In-Scope Open None
Other Operations 178628 Improve puppet alerting In-Scope Open None
Other Operations 184230 Disavow emails from wikipedia.com In-Scope Open None
Other Operations 112774 solve mtp panel issue for row uplinks In-Scope Open None
Other Operations 195293 503 error attempting to open multiple projects (Wikipedia and meta wiki are loading very slowly) Screep Open None
Other Operations 146090 High failure rate of account creation should trigger an alarm / page people In-Scope Open None
Other Operations 153068 Consider mounting labs NFS labstore1003.eqiad.wmnet:/scratch for server-side uploads In-Scope Open None
Other Operations 176666 Qualtrics cannot send email to wikimedia.org addresses In-Scope Open None
Other Operations 137229 Tune thread for osm2pgsql / postgres max connections for Maps In-Scope Open None
Other Operations 139971 access_new_install role vs. Labs vs. the future In-Scope Open None
Other Operations 166937 Broken /a/refinery-source/guard/run_all_guards.sh script on stat1002 In-Scope Open None
Other Operations 191842 Deployment git server can't supply ORES hosts in parallel Screep Open None
Other Operations 151050 Proper documentation for Yubico 2FA for production use In-Scope Open None
Other Operations 182597 Use EtcdConfig in production to allow automation of a datacenter switch In-Scope Open None
Other Operations 193121 Upgrade ganeti hosts to stretch Screep Open None
Other Operations 179354 wikimedia-jessie & wikimedia-stretch docker images don't have deb-src set for apt.wikimedia.org In-Scope Open None
Other Operations 166066 Integrate the puppet compiler in the puppet CI pipeline In-Scope Open None
Other Operations 136603 Update limit.sh to support systemd-based cgroup management In-Scope Open None
Other Operations 103886 Translation cache exhaustion caused by changes to PHP code in file scope In-Scope Open None
Other Operations 185815 The Rack Puppet master server is deprecated and will be removed in a future release. Please use Puppet Server instead. In-Scope Open None
Other Operations 165323 Add Prometheus machine metric to track core dumps In-Scope Open None
Other Operations 163393 Determine appropriate proxy_read_timeout setting for Tools Proxy In-Scope Open None
Other Operations 133093 Investigate idle appservers in codfw In-Scope Open None
Other Operations 175213 2017/18 Annual Plan Program 8: Multi-datacenter support, Q2 goals In-Scope Open None
Other Operations 141038 implement icinga paging for non-ops teams In-Scope Open None
Other Operations 192948 Upgrade prometheus-jmx-exporter on all services using it Screep Open None
Other Operations 170456 FY2017/18 Program 6 - Outcome 2 - Objective 3: Integrated, container-based development environment In-Scope Open None
Other Operations 153940 Logrotate fails for: "$FILE No such file or directory" In-Scope Open None
Other Operations 188602 Decrease the amount of IRC spam in case of widespread puppet failures In-Scope Open None
Other Operations 162612 codfw/eqiad hosts occasionally spend > 3 minutes starting networking.service with linux 4.9 In-Scope Open None
Other Operations 193605 Alert when elasticsearch writes are frozen for too long Screep Open None
Other Operations 163698 Add flood protection to the ircecho bot (icinga-wm) In-Scope Open None
Other Operations 156232 confctl SubjectAltNameWarning after python-urllib3 upgrade In-Scope Open None
Other Operations 147366 Setup automated topk wide row reporting In-Scope Open None
Other Operations 95054 Move ircecho config file to be YAML In-Scope Open None
Other Operations 187101 Setup some alert mechanism when some 'critical' cron jobs fail In-Scope Open None
Other Operations 129963 Update memcached package and configuration options In-Scope Open None
Other Operations 113785 Make the Shinken IRC alert and icinga-wm bots use colors In-Scope Open None
Other Operations 150872 Replace OCG in collection extension with Electron In-Scope Open None
Other Operations 150823 Puppet CA rollover In-Scope Open None
Other Operations 141128 determine/process/document bios firmware tracking/updating policies In-Scope Open None
Other Operations 158915 Setup reply emails for gerrit In-Scope Open None
Other Operations 177826 Upgrade ci ssh key to ecdsa In-Scope Open None
Other Operations 184461 Discourse migration from wmflabs to production In-Scope Open None
Other Operations 182822 Generate a list of files that are supposed to exist but 404s In-Scope Open None
Other Operations 116742 Track amount of package updates on systems In-Scope Open None
Other Operations 187991 Have swift metrics available in Prometheus In-Scope Open None
Other Operations 179353 Scap: Standardize git version In-Scope Open None
Other Operations 153246 Puppet failures with "Attempt to assign to a reserved variable name: 'trusted'" In-Scope Open None
Other Operations 159830 Sanity check global-multiwrite logs for ConfirmEdit usage In-Scope Open None
Other Operations 184063 Remove all decommissioned hardware In-Scope Open None
Other Operations 169518 Decommission esams ms-fe / ms-be In-Scope Open None
Other Operations 140316 Add granularity limiter (g=) to wikimedia.org DKIM record(s) In-Scope Open None
Other Operations 170640 reports.frdev.wm.o -- still in use? In-Scope Open None
Other Operations 173721 Track down the source of periodic increases in requests to swift eqiad In-Scope Open None
Other Operations 155683 Rename (create anew) the TC team mailing list Screep Open None
Other Operations 164042 Racktables: clearly show when hosts are decommissioned In-Scope Open None
Other Operations 153416 docker-engine pulled into our repositories only keeps the latest version In-Scope Open None
Other Operations 174916 electron/pdfrender hangs In-Scope Open None
Other Operations 182034 Decommission osm-cp100[1-4] In-Scope Open None
Other Operations 109606 Re-evaluate Limesurvey In-Scope Open None
Other Operations 161096 confctl no longer logs a non-changing state change In-Scope Open None
Other Operations 142002 Clean up puppet & configs for ORES In-Scope Open None
Other Operations 124101 Specific revisions of multiple files missing from Swift - 404 Not Found returned In-Scope Open None
Other Operations 193272 Prometheus vs. CPU usage vs. hyperthreading Screep Open None
Other Operations 182759 Add Prometheus exporter to Jenkins instances In-Scope Open None
Other Operations 169290 New anti-stackclash (4.9.25-1~bpo8+3 ) kernel super bad for NFS In-Scope Open None
Other Operations 181205 let quarry use the mariadb module In-Scope Open None
Other Operations 178445 flapping monitoring for recommendation_api on scb In-Scope Open None
Other Operations 147204 Update confd package In-Scope Open None
Other Operations 117673 labs precise and jessie instance not accessible after provisioning In-Scope Open None
Other Operations 151304 tmpreaper possible race condition In-Scope Open None
Other Operations 159536 Puppet constantly trying to stop the already stopped puppetmaster process on Trusty In-Scope Open None
Other Operations 147040 Two recently uploaded files have disappeared (404) In-Scope Open None
Other Operations 157038 Make it possible to run the mediawiki testsuite against a staging repo of apt.wikimedia.org In-Scope Open None
Other Operations 124991 evaluate possibility for nscd use with useldap In-Scope Open None
Other Operations 184086 Add prometheus exporter to Gerrit In-Scope Open None
Other Operations 140813 Protect sensitive user-related information with a UserData / auth / session service In-Scope Open None
Other Operations 148017 lvs2002 repeated usb connect/disconnect message In-Scope Open None
Other Operations 150396 Phabricator leaving old files in /tmp In-Scope Open None
Other Operations 182819 custom fact interface_primary breaks under newer versions of facter In-Scope Open None
Other Operations 193649 migrate elasticsearch to stretch (from jessie) Screep Open None
Other Operations 188377 Import some Analytics git puppet submodules to operations/puppet In-Scope Open None
Other Operations 76306 Set warning thresholds for average cluster utilization In-Scope Open None
Other Operations 67394 [EPIC] Performance testing environment In-Scope Open None
Other Operations 193112 Jobs writing to the Elasticsearch cluster in codfw are timing out, causing all type of issues Screep Open None
Other Operations 190693 Extend dpkg Icinga check to also check for inconsistent apt state In-Scope Open None
Other Operations 183454 Deprovision Diamond collectors no longer in use In-Scope Open None
Other Operations 175361 Upgrade mx1001/mx2001 to stretch In-Scope Open None
Other Operations 192370 Deploy mcrouter to production as a wancache backend Screep Open None
Other Operations 119719 Enforce a minimum refresh period for grafana dashboards hitting graphite In-Scope Open None
Other Operations 135991 Automated service restarts for common low-level system services In-Scope Open None
Other Operations 180330 Add CI to all operations/* repositories and archive obsolete ones In-Scope Open None
Other Operations 150486 Deploy federation for Prometheus In-Scope Open None
Other Operations 174465 Puppet admin module should support adding system users to managed groups In-Scope Open None
Other Operations 161528 incident 20170323-wikibase did not trigger Icinga paging In-Scope Open None
Other Operations 151047 Integrate Yubikey into data.yaml In-Scope Open None
Other Operations 175738 Long term storage for frack prometheus data In-Scope Open None
Other Operations 180944 Passenger spews Exception NoMethodError in Rack application object In-Scope Open None
Other Operations 88730 Nutcracker needs to automatically recover from MC failure - rebalancing issues In-Scope Open None
Other Operations 126158 [RFC] Alert about *when* partitions will run out of space, not a percentage/absolute number In-Scope Open None
Other Operations 133091 Highest SSTables / read thresholds In-Scope Open None
Other Operations 106346 setup an alertable threshold for Cassandra heap dumps In-Scope Open None
Other Operations 170480 FY2017/18 Program 6 - Outcome 2: Developers are able to develop and test their applications through a unified pipeline towards production deployment. In-Scope Open None
Other Operations 192102 deprecate and remove --autoload in uwsgi puppet class Screep Open None
Other Operations 109089 EPIC: Cultivating the Elasticsearch garden (operational lessons from 1.7.1 upgrade) In-Scope Open None
Other Operations 184064 Prepare racks OE14, OE15 and OE16 with new infrastructure In-Scope Open None
Other Operations 154665 Look into behaviour of /etc/exim4/update-exim4.conf.conf related to updates In-Scope Open None
Other Operations 160158 Make disabled accounts visible in the corp mirror LDAP replica In-Scope Open None
Other Operations 145065 Decrease time required to fully restart the Cirrus elasticsearch clusters In-Scope Open None
Other Operations 147923 Extract metrics from logs In-Scope Open None
Other Operations 191625 Create RC feed for login.wikimedia Screep Open None
Other Operations 123237 Provide production jessie image with node 4.2; use this for service-runner build command In-Scope Open None
Other Operations 189137 Migrate CirrusSearch jobs to Kafka queue In-Scope Open None
Other Operations 164819 reprepro: Support for buildinfo files / dbgsym packages In-Scope Open None
Other Operations 135113 Rationalize our jobqueues redis topology In-Scope Open None
Other Operations 152439 cronspam from labtestservices2001 /etc/dns-floating-ip-updater.py > /dev/null In-Scope Open None
Other Operations 134551 Create functional cluster checks for all services (and have them page!) In-Scope Open None
Other Operations 184065 Setup new access switches In-Scope Open None
Other Operations 186918 prometheus: ganglia-gen and outdated Ganglia:cluster resource name In-Scope Open None
Other Operations 32716 Run our own Tor client for Tor block In-Scope Open None
Other Operations 108985 Monitor MediaWiki sessions In-Scope Open None
Other Operations 143552 Make elasticsearch configuration more robust to loss of network connectivity In-Scope Open None
Other Operations 187474 Decommission old and unused/spare servers in codfw In-Scope Open None
Other Operations 151048 Icinga monitoring for Yubikey components In-Scope Open None
Other Operations 76203 Make ircecho run as its own user In-Scope Open None
Other Operations 170817 Upgrade Thumbor servers to Stretch In-Scope Open None
Other Operations 190318 remove puppet_major_version and puppetdb_major_version variables. clean up puppet master/db hieradata In-Scope Open None
Other Operations 169020 Decommission cp400[1-4] In-Scope Open None
Other Operations 135124 Deploy etcddump (or another etcd dump & load tool) to production In-Scope Open None
Other Operations 194176 wtp2020 correctable memory errors Screep Open None
Other Operations 190455 Logstash no longer captures DB queries in debug mode Screep Open None
Other Operations 168490 upgrade planet instances to stretch In-Scope Open None
Other Operations 176957 Decommission host copper.eqiad.wmnet In-Scope Open None
Other Operations 181763 Decommission niobium In-Scope Open None
Other Operations 184564 Plan Puppet 5 upgrade In-Scope Open None
Other Operations 164290 Set up external DNS record for wikitech-static In-Scope Open None
Other Operations 145885 Gerrit shows HTTP 500 error when pasting extended unicode characters In-Scope Open None
Other Operations 127054 pinentry-gtk2 pulls in a lot of unneeded Gnome/GTK libs In-Scope Open None
Other Operations 187445 Decommission osm-db200[12] and osm-web200[1234] In-Scope Open None
Other Operations 181630 Send celery and wsgi service logs to logstash In-Scope Open None
Other Operations 178663 Switch CI Docker Storage Driver to its own partition and to use devicemapper In-Scope Open None
Other Operations 161145 Fix the general problem of randomly-bad puppet agent cron timings within redundant clusters In-Scope Open None
Other Operations 186153 outdated DjVu file page thumbnail in cache In-Scope Open None
Other Operations 187364 rack frdata1001 In-Scope Open None
Other Operations 169035 bast3002 sdb broken In-Scope Open None
Other Operations 194997 Track more detailed disk usage on maps servers Screep Open None
Other Operations 182015 Decommission Vanadium In-Scope Open None
Other Operations 193196 labnet1003 and labnet1004 moving and enabling 10G NICs Screep Open None
Other Operations 135595 mod_deflate + mod_uwsgi causing mangled apache responses In-Scope Open None
Other Operations 156937 Provide cross-dc redundancy (active-active or active-passive) to all important misc services In-Scope Open None
Other Operations 187994 netfilter software at WMF: iptables vs nftables In-Scope Open None
Other Operations 189890 add ssh key comparison to cross-validate-accounts.py In-Scope Open None
Other Operations 147872 Rename rhodium to puppetmaster1003 In-Scope Open None
Other Operations 175710 Add profiling for Varnish and VCL In-Scope Open None
Other Operations 184186 Fix unknown variables warning that occur with puppet 4.x In-Scope Open None
Other Operations 148693 Deploy IDS rendering engine to production In-Scope Open None
Other Operations 186311 wikitech-l is mangling my PGP/MIME emails, causing signature validation to fail Screep Open None
Other Operations 118746 Goal: Strengthen Incident monitoring infrastructure In-Scope Open None
Other Operations 182812 Forward security@tools.wmflabs.org to security@wikimedia.org In-Scope Open None
Other Operations 116750 2FA for SSH access to the production cluster In-Scope Open None
Other Operations 125976 Run mediawiki::maintenance scripts in Beta Cluster Screep Open None
Other Operations 181559 Investigate redis-cluster or other techniques for making Redis not a single point of failure. In-Scope Open None
Other Operations 164993 archiva artifact links point to 127.0.0.1 In-Scope Open None
Other Operations 189435 Integrate stretch 9.4 point update In-Scope Open None
Other Operations 152767 Missing Labs hiera entry in labs-private repo In-Scope Open None
Other Operations 98984 Check power supply balance settings on cp3030+ In-Scope Open None
Other Operations 109090 Investigate the need for master only (non data nodes) in our ES cluster In-Scope Open None
Other Operations 104671 Rename 'restricted' group? In-Scope Open None
Other Operations 180498 planet.wikimedia.org: replace planet-venus software with rawdog In-Scope Open None
Other Operations 179696 Homepage for https://docker-registry.wikimedia.org In-Scope Open None
Other Operations 170453 FY2017/18 Program 6: Streamlined Service delivery In-Scope Open None
Other Operations 105780 Create a doc explaining the SLA between services and the monitoring tool In-Scope Open None
Other Operations 187456 Decommission labstore100[12] and their disk shelves In-Scope Open None
Other Operations 154915 Get rid of "import realm.pp" in manifests/site.pp In-Scope Open None
Other Operations 122127 Translation of namespaces for Gilaki In-Scope Open None
Other Operations 159687 etcd switchover/enhancements In-Scope Open None
Other Operations 192751 Please upload large file to Wikimedia Commons Screep Open None
Other Operations 149804 Review of ferm services without srange In-Scope Open None
Other Operations 194835 mw2182 crash Screep Open None
Other Operations 190766 add ci test for admin module indentation In-Scope Open None
Other Operations 41785 Create a labs SMTP smarthost In-Scope Open None
Other Operations 101585 document redis upgrade/restart procedures In-Scope Open None
Other Operations 161835 Convert labstore cluster configuration to hiera and profiles In-Scope Open None
Other Operations 163362 audit all codfw pdu tower draws In-Scope Open None
Other Operations 185215 Puppet compiler failure to lookup some keys In-Scope Open None
Other Operations 181988 Investigate and improve memory allocation rates of WDQS In-Scope Open None
Other Operations 150875 Confirm attribution needs In-Scope Open None
Other Operations 146657 create notifications about user accounts that have not been used for a long time In-Scope Open None
Other Operations 173097 Decommission stat1002.eqiad.wmnet In-Scope Open None
Other Operations 151049 Run systematic availability tests In-Scope Open None
Other Operations 133656 Have a paging check for Nova API accessible In-Scope Open None
Other Operations 148647 refresh swift hardware in codfw/eqiad In-Scope Open None
Other Operations 188913 "Obama" page on Beta Cluster often responds with 503 In-Scope Open None
Other Operations 132532 rsync module doesnt work on trusty In-Scope Open None
Other Operations 189729 Build .deb package of python3-typing for jessie In-Scope Open None
Other Operations 190085 Reclaim/Decommission Silver.wikimedia.org In-Scope Open None
Other Operations 148048 Store Wikimedia unified account name (SUL) in LDAP directory In-Scope Open None
Other Operations 169570 nfs-manage failover script needs to be tested with real load and fixed In-Scope Open None
Other Operations 191491 Adjust bandwidth/connection limits, memory settings on labstore1006,7 as appropriate Screep Open None
Other Operations 146968 OTRS spam classification methods and systems In-Scope Open None
Other Operations 163336 kube-proxy pulls in docker and starts service even when it isnt needed In-Scope Open None
Other Operations 137176 catch-all apache vhost on the cluster should return 404 for non-existing sites In-Scope Open None
Other Operations 132325 Weak digest algorithm (SHA1) used to sign InRelease on apt.wikimedia.org In-Scope Open None
Other Operations 161003 Cross-check disabled accounts from corp LDAP against data.yaml In-Scope Open None
Other Operations 160529 Sender email spoofing In-Scope Open None
Other Operations 160412 Add lock_wait_timeout to maintain_views and maintain-meta_p In-Scope Open None
Other Operations 165631 move gerrit.wm.org SSH service to private/behind LVS like phab-vcs In-Scope Open None
Other Operations 185189 scap sudo violation on first puppet run In-Scope Open None
Other Operations 123918 'swift' user/group IDs should be consistent across the fleet In-Scope Open None
Other Operations 191191 pdfrender logs to /var/log/syslog as well as to /srv/log/pdfrender Screep Open None
Other Operations 194174 wtp2013 memory correctable errors Screep Open None
Other Operations 180051 Reduce the number of fields declared in elasticsearch by logstash In-Scope Open None
Other Operations 106937 Monitor [[Special:ListFiles]] for non 200 HTTP statuses in thumbnails In-Scope Open None
Other Operations 185306 ms-be2023 unresponsive while rebuilding one disk In-Scope Open None
Other Operations 162955 rebuild tools-grid-master as a large instance In-Scope Open None
Other Operations 84279 Admin module should allow group management of system users In-Scope Open None
Other Operations 181967 Update puppet code to conform to puppet 4.x and later standards In-Scope Open None
Other Operations 169318 Use multiple puppetdbs on puppet masters In-Scope Open None
Other Operations 171188 Move the main WMCS puppetmaster into the Labs realm In-Scope Open None
Other Operations 133674 HHVM is leaking memory on the API appservers In-Scope Open None
Other Operations 133476 Proposal: Centralize OTRS login methodology In-Scope Open None
Other Operations 156136 Increase swift replication factor for accounts In-Scope Open None
Other Operations 126989 MediaWiki logging & encryption In-Scope Open None
Other Operations 155209 Increase $wgHTTPImportTimeout to a higher value on WMF wikis In-Scope Open None
Other Operations 126295 Spike: What do we have to package to run the Programs and Events dashboard on production? In-Scope Open None
Other Operations 167292 Collate jessie-wikimedia/backports into jessie-wikimedia/main In-Scope Open None
Other Operations 149643 Review Icinga alarms with disabled notifications In-Scope Open None
Other Operations 185275 replace tin (new hardware) In-Scope Open None
Other Operations 156143 High CPU usage from swift-proxy on frontend machines In-Scope Open None
Other Operations 151317 stat user crontab on stat hosts for old file removal In-Scope Open None
Other Operations 130593 investigate slapd memory leak In-Scope Open None
Other Operations 194184 rack/setup/install wdqs10[09|10].eqiad.wmnet Screep Open None
Other Operations 187736 Host deployment-puppetdb01 is DOWN: CRITICAL - Host Unreachable (10.68.23.76) In-Scope Open None
Other Operations 166038 Sync internal nutcracker package with Debian package In-Scope Open None
Other Operations 187473 Decommission old and unused/spare servers in eqiad In-Scope Open None
Other Operations 116805 DomainKeys Identified Mail (DKIM) for phabricator.wikimedia.org In-Scope Open None
Other Operations 184522 To purchase for next esams visit In-Scope Open None
Other Operations 157030 cannot delete non-empty directory: php-1.29.0-wmf.3 messages on 'scap sync' on mwdebug1002 In-Scope Open None
Other Operations 183954 templatetiger is using 827G of 8T available tools nfs storage In-Scope Open None
Other Operations 174269 Two cases of local-multiwrite storage backend failure In-Scope Open None
Other Operations 118812 Investigate mysterious_sysctl settings and figure out what to do with them In-Scope Open None
Other Operations 172815 Improve stability and maintainability of our browser-based PDF render service In-Scope Open None
Other Operations 182331 [Epic] Deploy ORES in kubernetes cluster In-Scope Open None
Other Operations 186288 replace all Ubuntu (trusty) hosts in production with Debian In-Scope Open None
Other Operations 181200 Use "Charter" as preferred typeface on Electron In-Scope Open None
Other Operations 149589 Puppet tab in Horizon unusably slow In-Scope Open None
Other Operations 117508 Make ops-l a list for humans again (no cheating) In-Scope Open None
Other Operations 192103 Decommission notebook1001 Screep Open None
Other Operations 159661 Improve Terbium (and wasat) userland to process server side uploads In-Scope Open None
Other Operations 169287 etcd config depends on puppet certs, but puppet doesn't know In-Scope Open None
Other Operations 192457 Reallocate former image scalers Screep Open None
Other Operations 157306 Fix config file handling for /etc/hhvm/php.ini In-Scope Open None
Other Operations 182497 Update log config for scb* boxes, to deal with ORES verbose logging In-Scope Open None
Other Operations 149543 Setup PAWS internal experimentally on notebook* nodes In-Scope Open None
Other Operations 180628 Install git-lfs client (at least on scap targets & masters) In-Scope Open None
Other Operations 176437 puppet ca_server confusion In-Scope Open None
Other Operations 184236 Puppet broken on deployment-ms-be0[34] with evaluation error in swift module In-Scope Open None
Other Operations 135125 Install a second etcd cluster in codfw In-Scope Open None
Other Operations 167689 Add RIPE atlas data to Prometheus In-Scope Open None
Other Operations 136094 Race condition in setting net.netfilter.nf_conntrack_tcp_timeout_time_wait In-Scope Open None
Other Operations 185004 Decommission mw1201-mw1220 In-Scope Open None
Other Operations 167377 Decommission cp4011, cp4012, cp4019, cp4020 In-Scope Open None
Other Operations 163288 Decide on /var/lib vs /home as locations of homedir for l10nupdate In-Scope Open None
Other Operations 151046 Fully puppetise yubikey-val In-Scope Open None
Other Operations 195059 Cannot add or update records under DNS zones in Horizon Screep Open None
Other Operations 163507 Intermittent DB connectivity problem on phabricator, needs investigation In-Scope Open None
Other Operations 174449 tin has a failing hdd In-Scope Open None
Other Operations 171482 Programmatic generation of grafana dashboards In-Scope Open None
Other Operations 164238 move icinga contacts file to public repo In-Scope Open None
Other Operations 161920 logrotate for ruthenium In-Scope Open None
Other Operations 132632 puppetize turning off reserved space for cassandra /srv In-Scope Open None
Other Operations 164490 maintain-meta_p hangs on connecting to wikimedia.org.uk In-Scope Open None
Other Operations 169286 labstore1005 A PCIe link training failure error on boot In-Scope Open None
Other Operations 135122 Reduce etcd technical debt In-Scope Open None
Other Operations 131326 smokeping config puppetization issue? In-Scope Open None
Other Operations 134458 status.wikimedia.org should use some Wikimedia favicon if possible In-Scope Open None
Other Operations 177371 Phase out DSA keys for SSH access (ssh-dss) In-Scope Open None
Other Operations 168403 Aggregate prometheus functions yielding different results in grafana vs. prometheus console In-Scope Open None
Other Operations 150771 Secondary production Jenkins for CI In-Scope Open None
Other Operations 187365 rack frpig1001 In-Scope Open None
Other Operations 184245 Create some mechanism for instances in projects to modify the project Designate records In-Scope Open None
Other Operations 138799 Create a simple puppet role for setting up a singlenode kubernetes install In-Scope Open None
Other Operations 176445 Systematically test load speeds of Watchlist and Recent Changes In-Scope Open None
Other Operations 130883 decom cp3011-22 (12 machines) In-Scope Open None
Other Operations 87220 Minimize differences between beta and production (Tracking) In-Scope Open None
Other Operations 111540 Clean up labs graphite datapoints In-Scope Open None
Other Operations 194249 kafka1023 correctable memory errors Screep Open None
Other Operations 187673 Build and deploy hhvm-luasandbox 3.0.1 to Wikimedia wikis In-Scope Open None
Other Operations 158757 Puppet certificate missing subjectAltName In-Scope Open None
Other Operations 179395 Cluster puppet variable and ganglia decommission In-Scope Open None
Other Operations 190111 VirtualHost for mod_status breaks debugging Apache/MediaWiki from localhost In-Scope Open None
Other Operations 158562 Manage apt sources via puppet? In-Scope Open None
Other Operations 191352 decom zinc/WMF3298 Elaborated Open None
Other Operations 175362 Split MXes into inbound and outbound In-Scope Open None
Other Operations 136562 Audit/fix hosts with no RAID configured In-Scope Open None
Other Operations 124185 Evaluate alternative web interfaces to icinga 1 core In-Scope Open None
Other Operations 187190 Decommission graphite1002 In-Scope Open None
Other Operations 177914 Switch labstore servers to default SSH configuration In-Scope Open None
Other Operations 161834 Undo special tools-home and tools-project share definitions for NFS In-Scope Open None
Other Operations 192532 Figure out a way to enable volunteers to use the puppet compiler Screep Open None
Other Operations 166368 Wipe of spare/replacement disks In-Scope Open None
Other Operations 82937 re-create script for manual paging In-Scope Open None
Other Operations 153816 apache::static_site is not working In-Scope Open None
Other Operations 175210 Select candidate jobs for transferring to the new infrastucture In-Scope Open None
Other Operations 118829 Automate the provisioning and management of MediaWiki clusters In-Scope Open None
Other Operations 188317 Detect high server load earlier – prometheus alert? In-Scope Open None
Other Operations 115757 document debian packaging guidelines In-Scope Open None
Other Operations 188601 Gain visibility into httpd mod_proxy actions In-Scope Open None
Other Operations 163996 Icinga check for ipv6 host reachability In-Scope Open None
Other Operations 193766 Ship host syslogs to ELK Screep Open None
Other Operations 138496 bring swift eqiad to one zone per row In-Scope Open None
Other Operations 118331 Alert when used_memory gets too high for redis queues In-Scope Open None
Other Operations 184655 logstash group1 dashboard incorrectly shows testwikidatawiki In-Scope Open None
Other Operations 113104 Set up a service IP for logstash In-Scope Open None
Other Operations 142815 Enhance account handling (meta bug) In-Scope Open None
Other Operations 179230 Puppet wmf-style-guide: array of classes not detected properly In-Scope Open None
Other Operations 194172 mw2213 correctable memory errors Screep Open None
Other Operations 152445 Move prometheus entry point off port 80 In-Scope Open None
Other Operations 168460 Update certificates on productions replicas of corp.wikimedia.org LDAP In-Scope Open None
Other Operations 125015 Requests to (hard) redirect pages return their target's contents but are counted as pageviews to the redirect page In-Scope Open None
Other Operations 186416 Allow selecting which images to build In-Scope Open None
Other Operations 161296 Upgrade mysqld_exporter to 0.10.0 In-Scope Open None
Other Operations 192561 Upgrade deployment-prep deployment servers to stretch Screep Open None
Other Operations 190717 Update wikidiff2 library on the WMF production cluster In-Scope Open None
Other Operations 174475 update firmware on scs consoles In-Scope Open None
Other Operations 120856 Remove all out of warranty unused cp10xx's from A2 In-Scope Open None
Other Operations 138866 Update & standardize Platform-specific_documentation for HP servers In-Scope Open None
Other Operations 84700 Setup management switch in OE12 In-Scope Open None
Other Operations 133844 Improve Elasticsearch icinga alerting In-Scope Open None
Other Operations 179099 puppetmaster hostcert and hostprivkey point to nonexistent files In-Scope Open None
Other Operations 151314 logrotate failing with $FILE.1.gz: File exists In-Scope Open None
Other Operations 154627 Production error message (when servers are down) points users to donate link which is likely to produce the same error message In-Scope Open None
Other Operations 130617 Collect metrics on pool counter usage In-Scope Open None
Other Operations 158022 make apt.wikimedia.org HA In-Scope Open None
Other Operations 107108 Flow notification links on mobile point to desktop In-Scope Open None
Other Operations 114849 Log lines on flourine overflow at 8092 bytes. In-Scope Open None
Other Operations 133913 Completely port l10nupdate to scap In-Scope Open None
Other Operations 194907 Degraded RAID on labvirt1019 Screep Open None
Other Operations 114337 Assign 3 more servers to video scaler duty In-Scope Open None
Other Operations 187467 Decommission mw2017 and mw2099 In-Scope Open None
Other Operations 158837 Consolidate performance website and related software In-Scope Open None
Other Operations 183920 2018-01-02: labstore Tools and Misc share very full In-Scope Open None
Other Operations 127797 document all puppet classes / defined types!? In-Scope Open None
Other Operations 190716 Deploying FileExporter and FileImporter Screep Open None
Other Operations 167376 Decommission cp300[3456] In-Scope Open None
Other Operations 181855 scap support for git-lfs In-Scope Open None
Other Operations 95053 ircecho should accept input via unix sockets In-Scope Open None
Other Operations 179192 Check analytics1037 power supply status In-Scope Open None
Other Operations 191627 Remove Cassandra 2.2.6 packages from jessie-wikimedia/thirdparty apt repo Screep Open None
Other Operations 167549 Create Icinga alert when OSM replication lags on maps In-Scope Open None
Other Operations 135318 Document how to handle 'inconsistent state within the internal storage backends' issues In-Scope Open None
Other Operations 167966 Look into feasibility of disabling sha-1 host keys on our ssh daemons In-Scope Open None
Other Operations 124179 Improve access to and control over incident and metrics monitoring infrastructure In-Scope Open None
Other Operations 161004 Remove disabled users from internal mailing lists In-Scope Open None
Other Operations 83729 Fix monitoring of poolcounter service In-Scope Open None
Other Operations 183832 Rename project mailing list for Africa Wikimedia Developers project Screep Open None
Other Operations 134271 Replace ircd-ratbox with something newer/maintained In-Scope Open None
Other Operations 142821 Synchronise groups defined in data.yaml to LDAP In-Scope Open None
Other Operations 116747 Meta task "Revamp user authentication" In-Scope Open None
Other Operations 87790 decom amslvs1-4 (dc work) In-Scope Open None
Other Operations 159242 Segmentation fault creating thumbnail In-Scope Open None
Other Operations 193473 Add HTTPS support to wdqs-internal service Screep Open None
Other Operations 165885 Create a cron to clean clientbucket every day or hour In-Scope Open None
Other Operations 185504 Netbox: add Icinga check for PostgreSQL In-Scope Open None
Other Operations 128590 Cassandra uses default ip address for outbound packets while bootstrapping In-Scope Open None
Other Operations 89808 wikitech instances list is blank In-Scope Open None
Other Operations 183236 After reimage Puppet order: sudo command failed In-Scope Open None
Other Operations 187651 Setting packages on 'hold' breaks puppet runs In-Scope Open None
Other Operations 182228 run-no-puppet leave puppet disabled on kill/crash In-Scope Open None
Other Operations 181632 Celery manager implodes horribly if Redis goes down In-Scope Open None
Other Operations 182033 Decommission osm-web100[1-4] In-Scope Open None
Other Operations 130709 authoritative copy of 'root' files for upload.wikimedia.org is only in swift In-Scope Open None
Other Operations 111838 Some files had disappeared from Commons after renaming In-Scope Open None
Other Operations 185814 /etc/puppet/hiera.yaml: Use of 'hiera.yaml' version 3 is deprecated. It should be converted to version 5 In-Scope Open None
Other Operations 187076 Deploy error: insufficient permission for adding an object to repository database .git/objects In-Scope Open None
Other Operations 161598 Monitor HHVM bytecode cache depletion on mediawiki app servers In-Scope Open None
Other Operations 146627 Make deployment-prep puppetmaster more similar to Production puppetmaster In-Scope Open None
Other Operations 190225 Decommission unused host wmf3565 In-Scope Open None
Other Operations 94329 secure Cassandra/RESTBase cluster In-Scope Open None
Other Operations 193160 Monitor the BIOS boot order and parameters Screep Open None
Other Operations 180023 [DRAFT][RfC] Deployment of python applications in production In-Scope Open None
Other Operations 140442 reinstall rdb100[56] with RAID In-Scope Open None
Other Operations 194855 Degraded RAID on labvirt1020 Screep Open None
Other Operations 184924 Utilize the deployment pipeline (stretch) In-Scope Open None
Other Operations 182249 Diagnose and fix 4.5k req/min ceiling for ores* requests In-Scope Open None
Other Operations 141520 "MediaWiki exceptions and fatals per minute" alarm is too slow (half an hour delay!) In-Scope Open None
Other Operations 173492 Tune Varnishkafka delivery errors to be more sensitive In-Scope Open None
Other Operations 168967 Upload shiny-server .deb to our Jessie apt repository In-Scope Open None
Other Operations 125442 es2009 degraded RAID In-Scope Open None
Other Operations 150460 Configure maps cluster to send statsd metrics to the statsd endpoint in the same datacenter In-Scope Open None
Other Operations 178690 Better organization for ops grafana dashboards In-Scope Open None
Other Operations 138821 extend existing graphite whisper files retention to five years In-Scope Open None
Other Operations 148614 Icinga check for Tor In-Scope Open None
Other Operations 67270 Default license for operations/puppet In-Scope Open None
Other Operations 179501 Use external dsh group to list pooled ORES nodes In-Scope Open None
Other Operations 179856 Improve documentation for mirrors.wikimedia.org In-Scope Open None
Other Operations 149057 Designate seems very slow to delete records? In-Scope Open None
Other Operations 134875 udpmxircecho spam/not working if unable to connect to irc server In-Scope Open None
Other Operations 94215 decommission cp3001 & cp3002 In-Scope Open None
Other Operations 85451 scale graphite deployment (tracking) In-Scope Open None
Other Operations 181634 Investigate overload condition, seems that we lose nodes In-Scope Open None
Other Operations 46016 SVG fails to render properly due to several issues In-Scope Open None
Other Operations 188098 Add Prometheus collector for Tor In-Scope Open None
Other Operations 174959 swift-recon-cron on ms-be203[34]: [Errno 17] File exists: '/var/lock/swift-recon-object-cron' In-Scope Open None
Other Operations 170481 FY2017/18 Program 6 - Outcome 2 - Objective 2: Set up a continuous integration and deployment pipeline In-Scope Open None
Other Operations 193414 Servers using tidy-html5 are rendering pages differently, especially with <bdi> Screep Open None
Other Operations 166081 rack/setup/install conf1004-conf1006 In-Scope Open None
Other Operations 126083 overhaul labstore setup [tracking] In-Scope Open None
Other Operations 181621 What is causing ORES celery workers to suddenly require more CPU? In-Scope Open None
Other Operations 121240 Network isolation for production and semi-production services In-Scope Open None
Other Operations 169680 NFS on dataset1001 overloaded, high load on the hosts that mount it In-Scope Open None
Other Operations 191388 Puppet: tracking catalogs that changes at every run Screep Open None
Other Operations 191153 decom bast1001 In-Scope Open None
Other Operations 194234 anaytics1032's BBU is not working correctly Screep Open None
Other Operations 187984 Update OTRS to the latest stable version (6.x.x) In-Scope Open None
Other Operations 140141 Install mscorefonts on scaling servers for SVG rendering In-Scope Open None
Other Operations 195289 Add Addshore & possibly other WMDE devs/deployers to the wikidata icinga contact list Screep Open None
Other Operations 170152 mc2023 / mc2025 fail to mount root partition within 90 seconds using Linux 4.9 In-Scope Open None
Other Operations 128615 Get rid of Tool Labs home page check from shinken In-Scope Open None
Other Operations 82350 update exim::listserve::private::mailing_lists value in puppet In-Scope Open None
Other Operations 114801 operations-apache-config-lint replacement doesn't check syntax In-Scope Open None
Other Operations 131966 Default gateway unreachable on baham.wikimedia.org after reboot In-Scope Open None
Other Operations 163033 Create grafana dashboard for video scaler job runners In-Scope Open None
Other Operations 150466 publish kartotherian / tilerator metrics by cluster In-Scope Open None
Other Operations 168407 rack/setup/install labnodepool1002.eqiad.wmnet In-Scope Open None
Other Operations 136311 Monitor the BMC's event log for hardware errors In-Scope Open None
Other Operations 169884 Jobrunners generate mediawiki exceptions upon calling Closure$RecentChange::save In-Scope Open None
Other Operations 193408 SPF record for canonical domains Screep Open None
Other Operations 141756 audit / test / upgrade hp smartarray P840 firmware In-Scope Open None
Other Operations 178810 Wikibase: Increase batch size for HTMLCacheUpdateJobs triggered by repo changes. In-Scope Open None
Other Operations 186748 New service request: chromium-render/deploy In-Scope Open 5.0
Other Operations 160941 Improve SSH access information in onboarding documentation In-Scope Open None
Other Operations 181971 Disable hiera autolookups In-Scope Open None
Other Operations 120377 labmon1001 graphite instance archiver keeps archiving the same instances In-Scope Open None
Other Operations 153279 labnet/ labtestnet2001 - disk space - nova-api.log needs rotation In-Scope Open None
Other Operations 141783 Add monitoring for detecting when logstash services are down In-Scope Open None
Other Operations 101141 udp rcvbuferrors and inerrors on graphite1001 In-Scope Open None
Other Operations 92471 enable authenticated access to Cassandra JMX In-Scope Open None
Other Operations 116951 Reprepro should bail if it can't read and sign using the root keys In-Scope Open None
Other Operations 45952 Incorrect "non-identical file already exists" error when undeleting file on Commons In-Scope Open None
Other Operations 140075 investigate swift used space spikes since June 2016 In-Scope Open None
Other Operations 191956 Document how to fix IPMI issues on Wikitech Screep Open None
Other Operations 166291 Exim panics when spamd reaches maxchildren In-Scope Open None
Other Operations 193916 rename naos to deploy2001 and reinstall with stretch Elaborated Open None
Other Operations 162123 Running swiftrepl is not puppetized In-Scope Open None
Other Operations 125411 Diamond load averages do not contain scaled versions In-Scope Open None
Other Operations 140879 503 error raises again while trying to load a Wikidata page In-Scope Open None
Other Operations 193025 Decommision poolcounter1002 Screep Open None
Other Operations 163402 Ensure we can survive a loss of labservices1001 In-Scope Open None
Other Operations 100777 expose hosts in maintenance state so we can prevent scap from running on them In-Scope Open None
Other Operations 189566 Decommission eventlog1001 In-Scope Open None
Other Operations 184634 Netbox: postgres cannot be restarted w/ current config In-Scope Open None
Other Operations 155929 Create /community-beacon alternative entry point In-Scope Open None
Other Operations 133164 Document eqiad/codfw transition plan for OCG In-Scope Open None
Other Operations 179463 Create a single application to provision and manage developer (LDAP) accounts In-Scope Open None
Other Operations 132104 Consider moving policy.wikimedia.org away from WordPress.com In-Scope Open None
Other Operations 160101 Upgrade php5-json .deb to at least 1.3.8 In-Scope Open None
Other Operations 185024 Readd complete URL parsing fix from 3.18.7 release In-Scope Open None
Other Operations 189741 Build .deb package of python3-aiokafka In-Scope Open None
Other Operations 106664 Set up role accounts and feedback loops (FBL) with all providers In-Scope Open None
Other Operations 160644 Eventstreams graphite disk usage In-Scope Open None
Other Operations 179562 Create jenkins job for creating deployment artifacts for `docker-pkg-deploy` In-Scope Open None
Other Operations 111934 Nutcracker stats monitoring should only listen on localhost In-Scope Open None
Other Operations 91404 Setup backups of elasticsearch indices In-Scope Open None
Other Operations 156475 Investigate spike in 500s during asw-c2-eqiad replacement In-Scope Open None
Other Operations 171122 librenms: consider using Distributed Poller with multiple netmon servers In-Scope Open None
Other Operations 93531 secure.wikimedia.org entries still showing up in Google search results In-Scope Open None
Other Operations 104352 Make scap able to depool/repool servers via the conftool API In-Scope Open None
Other Operations 118677 Nastaleeq font for Western Punjabi In-Scope Open None
Other Operations 123809 Module uwsgi doesn't allow passing multiple config params of same name In-Scope Open None
Other Operations 165511 Change automatic shortlink in blog theme In-Scope Open None
Other Operations 160146 jobrunner/jobchron services fail in codfw In-Scope Open None
Other Operations 158288 Unclean stop of jobrunner service via puppet In-Scope Open None
Other Operations 132324 Tracking and Reducing cron-spam to root@ In-Scope Open None
Other Operations 187078 Re-consider ` >/dev/null 2>&1` as output of many cron'd MW maintenance scripts In-Scope Open None
Other Operations 95742 Decomission amssq31-62 (32 hosts) In-Scope Open None
Other Operations 182203 Tuning profile::ores::celery parameters should cause a Celery service restart In-Scope Open None
Other Operations 150822 Internal PKI for secure communication - Barcelona Ops offsite 2016 In-Scope Open None
Other Operations 160229 Back up of Commons files In-Scope Open None
Other Operations 163823 During labservices1001 failover fqdn changed from foo.project.eqiad.wmflabs to foo.eqiad.wmflabs In-Scope Open None
Other Operations 174172 unused grafana-dashboard indices on elasticsearch / logstash In-Scope Open None
Other Operations 167245 prometheus-node-exporter - invalid group: ‘prometheus:prometheus' In-Scope Open None
Other Operations 191348 decommission uranium/WMF3128 Elaborated Open None
Other Operations 136403 Move cp3030+ from OE14 to OE13 in racktables In-Scope Open None