TechnicalOperations Status Report

Project Operations from 2017-07-01 to 2017-08-12

Help

Network Operations 171714 "MySQL server has gone away" from librenms logs Screep Done None
Network Operations 83196 internal network packet loss alerting Screep Done None
Network Operations 172478 git-ssh.wikimedia.org and IPv6 are broken after switch to phab1001 Screep Done None
Network Operations 148506 Rack and setup new eqiad row D switch stack (EX4300/QFX5100) In-Scope Done None
Network Operations 169624 deploy diffscan2 Screep Done None
Network Operations 167321 codfw: labtestpuppetmaster2001 switch port configuration In-Scope Done None
Network Operations 170380 codfw row C switch upgrade Screep Done None
Network Operations 171167 Evaluate LibreNMS' Graphite backend Screep Done None
Network Operations 169345 codfw row B switch upgrade In-Scope Done None
Network Operations 170369 Remove unsecure SSH algorithms on network devices Screep Done None
Network Operations 171970 Move codfw frack to new infra Screep Open None
Network Operations 82038 create a test for multicast relay In-Scope Open None
Network Operations 167840 Merge AS14907 with AS43281 In-Scope Open None
Network Operations 163674 Frequent RST returned by appservers to LVS hosts In-Scope Open None
Network Operations 171032 Investigate lvs IP pages during codfw row C switch upgrade Screep Open None
Network Operations 133387 Enabling IGMP snooping on QFX switches breaks IPv6 (HTCP purges flood across codfw) In-Scope Open None
Network Operations 150264 Icinga check for VRRP In-Scope Open None
Network Operations 170144 Evaluate NetBox as a Racktables replacement & IPAM Screep Open None
Network Operations 150256 Re-setup lvs1007-lvs1012, replace lvs1001-lvs1006 In-Scope Open None
Network Operations 157435 Review ACLs for the Analytics VLAN In-Scope Open None
Network Operations 98006 Anycast (Auth)DNS In-Scope Open None
Network Operations 120425 dumps.wikimedia.org seems to have poor throughput towards some destinations In-Scope Open None
Network Operations 86541 setup wifi in codfw In-Scope Open None
Network Operations 165584 Deploy pybal with BGP MED support (for primary/backup) in production In-Scope Open None
Network Operations 167306 ospf link-protection In-Scope Open None
Network Operations 171823 Grafana dashboards for librenms graphite data Screep Open None
Network Operations 169644 eqiad: rack frack refresh equipment Screep Open None
Network Operations 169643 codfw: rack frack refresh equipment Screep Open None
Network Operations 83992 Juniper monitoring In-Scope Open None
Network Operations 172459 eqiad row D switch upgrade Screep Open None
Network Operations 146391 eeden ethernet outage In-Scope Open None
Network Operations 167299 Upgrade BIOS/RBSU/etc on lvs1007 In-Scope Open None
Network Operations 122406 Consider renumbering Labs to separate address spaces In-Scope Open None
Network Operations 166171 rack/setup/wire/deploy msw2-c1-eqiad In-Scope Open None
Traffic 151643 python-varnishapi daemons seeing "Log overrun" constantly In-Scope Done None
Traffic 70528 stream.wikimedia.org - redirect http(s) to docs In-Scope Done None
Traffic 170843 Determine where to host zim files for the Android app Screep Done None
Traffic 172101 OCSP update failed for /etc/update-ocsp.d/globalsign-2016-ecdsa-unified.conf Screep Done None
Traffic 82747 pybal health checks are ipv4 even for ipv6 vips In-Scope Done None
Traffic 161101 Performance impact evaluation of enabling nginx-lua and nginx-lua-prometheus on tlsproxy In-Scope Done None
Traffic 160616 Enable HTTPS for swift clients In-Scope Done None
Traffic 170295 remove benefactorevents.wikimedia.org cname from DNS Screep Done None
Traffic 170193 revoke eventdonations.wikimedia.org SSL cert if there is one... Screep Done None
Traffic 132521 Enforce HTTPS+HSTS on remaining one-off sites in wikimedia.org that don't use standard cache cluster termination In-Scope Done None
Traffic 170192 remove eventdonations.wikimedia.org CNAME Screep Done None
Traffic 170140 revoke benefactorevents.wikimedia.org SSL certificate Screep Done None
Traffic 157353 prometheus-vhtcpd-stats cronspamming if vhtcpd is not running yet In-Scope Done None
Traffic 165736 Update Varnishkafka to support TLS encryption/authentication In-Scope Done None
Traffic 172417 Create wikimedia.org/resources redirect for Wikimedia Resource Center Screep Done None
Traffic 104225 enwiki Main_Page timeouts In-Scope Done None
Traffic 169893 pybal should reset the etcdindex it's looking at after losing a connection Screep Done None
Traffic 154759 Pybal not happy with DNS delays In-Scope Done None
Traffic 137161 Fix nits in HTTPS/HSTS configs in externally-hosted fundraising domains In-Scope Done None
Traffic 154227 URLs with title query string parameter and additional query string parameters do not redirect to mobile site Screep Done 3.0
Traffic 164579 Investigate nginx reload behavior In-Scope Done None
Traffic 171318 logster should not resolve statsd's IP every time it sends a metric Screep Done None
Traffic 168919 stream.wikimedia.org: remove legacy rcstream/socket.io HTTPS redirect hole punches In-Scope Done None
Traffic 171145 cp3048 down, mgmt console not reachable Screep Done None
Traffic 168013 Remove disableImages handling from VCL In-Scope Done None
Traffic 166695 Substantive HTTP and mediawiki/database traffic coming from a single ip In-Scope Open None
Traffic 101525 Set up LVS for current AuthDNS In-Scope Open None
Traffic 165765 Refactor pybal/LVS config for shared failover In-Scope Open None
Traffic 102178 Fix RESTBase support for wikitech.wikimedia.org In-Scope Open None
Traffic 102367 Migrate tools.wmflabs.org to https only (and set HSTS) In-Scope Open None
Traffic 102848 Split GeoIP into a new component In-Scope Open None
Traffic 165560 Artificial spike in offset of unique devices from November to February 6th on wikidata In-Scope Open None
Traffic 164868 SSL error for https://wikispecies.org/ In-Scope Open None
Traffic 104442 Investigate better DNS cache/lookup solutions In-Scope Open None
Traffic 164768 Explicitly limit varnishd transient storage In-Scope Open None
Traffic 104681 HTTPS Plans (tracking / high-level info) In-Scope Open None
Traffic 164609 Merge cache_misc into cache_text functionally In-Scope Open None
Traffic 164456 Build nginx without image filter support In-Scope Open None
Traffic 105657 Expires header for load.php should be relative to request time instead of cache time In-Scope Open None
Traffic 164327 replace ulsfo aging servers In-Scope Open None
Traffic 164259 Add VSL error counters to Varnishkafka stats In-Scope Open None
Traffic 106517 upload.wikimedia.org returns HTTP status code 503 for truncated urls, not 404 In-Scope Open None
Traffic 163541 cache hosts should auto-repool iff OCSP files are sane In-Scope Open None
Traffic 163251 Communicate dropping IE8-on-XP support (a security change) to affected editors and other community members In-Scope Open None
Traffic 107236 Switch port 80 to nginx on primary clusters In-Scope Open None
Traffic 163233 Implement Varnish-level rough ratelimiting In-Scope Open None
Traffic 163141 dbtree: make wasat a working backend and become active-active In-Scope Open None
Traffic 162818 icinga alerts on nodejs services when a recdns server is depooled In-Scope Open None
Traffic 108580 HTTPS for internal service traffic In-Scope Open None
Traffic 162683 Network hardware purchasing for Asia Cache DC In-Scope Open None
Traffic 162362 Make maps active / active In-Scope Open None
Traffic 162099 lvs2002 random shut down In-Scope Open None
Traffic 161517 Allow anonymous users to change interface language on Commons with ULS In-Scope Open None
Traffic 109325 Outbound HTTPS for varnish backend instances In-Scope Open None
Traffic 109331 Deleted files sometimes remain visible to non-privileged users if permanently linked In-Scope Open None
Traffic 161360 404 loading images from Virgin Media In-Scope Open None
Traffic 109776 Tilerator should purge Varnish cache In-Scope Open None
Traffic 161256 multi-component wmflabs.org subdomains doesn't work under simple wildcard TLS cert In-Scope Open None
Traffic 159429 Allow setting varnish connection timeouts in puppet In-Scope Open None
Traffic 111588 RFC: API-driven web front-end In-Scope Open None
Traffic 159412 Convert all of our site.pp/roles to the role/profile paradigm In-Scope Open None
Traffic 159411 Uniform cluster nomenclature across puppet In-Scope Open None
Traffic 159346 convert mail servers from GS to LE certificates In-Scope Open None
Traffic 159137 certspotter: Error retrieving STH from log In-Scope Open None
Traffic 159056 cp2017 froze and stopped serving traffic In-Scope Open None
Traffic 112316 Configure varnish to use "Unconfigured domain" page for 404 Not Served (instead of generic error) In-Scope Open None
Traffic 158599 Samsung Internet's desktop mode getting redirected to mobile site In-Scope Open None
Traffic 112765 Phabricator needs to expose notification daemon (websocket) In-Scope Open None
Traffic 156320 $wgServer with initial https:// does not force HTTPS (wgSecureLogin) In-Scope Open None
Traffic 156032 Server hardware installation for Asia Cache DC In-Scope Open None
Traffic 156030 Select site vendor for Asia Cache Datacenter In-Scope Open None
Traffic 156028 Name Asia Cache DC site In-Scope Open None
Traffic 114104 pybal doesn't fully manage LVS table leaving stale services (on IP change) In-Scope Open None
Traffic 155806 Add CAA records to our domains In-Scope Open None
Traffic 155314 Varnish does not cache Action API responses when logged in In-Scope Open None
Traffic 154954 Purge Varnish cache when a banner is saved In-Scope Open 2.0
Traffic 154801 Investigate varnishd child crashes when multiple nodes get depooled/pooled concurrently In-Scope Open None
Traffic 154702 Fix broken referer categorization for visits from Safari browsers In-Scope Open None
Traffic 154017 compile number of http uses for http://www.wikidata.org/entity In-Scope Open None
Traffic 153563 Consider switching to HTTPS for Wikidata query service links In-Scope Open None
Traffic 153468 Ferm/DNS library weirdness on deployment-mediawiki boxes In-Scope Open None
Traffic 152882 Many misc wikis lack mobile domains In-Scope Open None
Traffic 152622 Wikipedia.cz and other domains owned by WMCZ have invalid certificate In-Scope Open None
Traffic 152091 Block hotlinking In-Scope Open None
Traffic 150673 Thumb API: Varnish / CDN questions In-Scope Open None
Traffic 150479 Prometheus varnish metric churn due to VCL reloads In-Scope Open None
Traffic 117435 Spike: CentralNotice: Verify that our Special:HideBanners cookie storm works as efficiently as possible In-Scope Open 2.0
Traffic 150022 thumb_handler.php should not set CC:no-cache on renderer 404 responses? In-Scope Open None
Traffic 117826 TEST: redirect small portion of unauthenticated desktop users to mobile web In-Scope Open None
Traffic 149873 CentralNotice: Review and update Varnish caching for Special:BannerLoader In-Scope Open 2.0
Traffic 118181 Planning for phasing out non-Forward-Secret TLS ciphers In-Scope Open None
Traffic 149847 Use content hash based image / thumb URLs In-Scope Open None
Traffic 148983 cp3021 failed disk sdb In-Scope Open None
Traffic 118468 point wikilovesmonuments.org ns to wmf In-Scope Open None
Traffic 148976 Strongswan Icinga check: do not report issues about depooled hosts In-Scope Open None
Traffic 148422 cp3009: memory scrubbing error In-Scope Open None
Traffic 119038 Image cache issue when 'over-writing' an image on commons In-Scope Open None
Traffic 148134 OCSP Stapling for Intermediates In-Scope Open None
Traffic 148131 Deploy redundant unified certs In-Scope Open None
Traffic 119366 Disable caching on the main page for anonymous users In-Scope Open None
Traffic 119372 Pybal IdleConnectionMonitor with TCP KeepAlive shows random fails if more than 100 servers are involved. In-Scope Open None
Traffic 119396 Create globally-unique varnish cache cluster port/instancename mappings In-Scope Open None
Traffic 147202 Removing support for AES128-SHA TLS cipher In-Scope Open None
Traffic 147199 Removing support for DES-CBC3-SHA TLS cipher (drops IE8-on-XP support) In-Scope Open None
Traffic 146832 Clarify caching to enable direct Wikidata Query Service access by <mapframe/link> In-Scope Open None
Traffic 146619 DNS domains registered to WMF no longer redirecting In-Scope Open None
Traffic 146332 Create short link for outreachdashboard.wmflabs.org In-Scope Open None
Traffic 145661 varnish backends start returning 503s after ~6 days uptime In-Scope Open None
Traffic 144626 Strong cipher preference ordering for cache terminators In-Scope Open None
Traffic 144508 Point wikipedia.in to 205.147.101.160 instead of URL forward In-Scope Open None
Traffic 120121 Improve Varnish XFF processing for trusted proxies In-Scope Open None
Traffic 144194 Varnish-triggered CN campaign about browser security In-Scope Open None
Traffic 144187 Better handling for one-hit-wonder objects In-Scope Open None
Traffic 143562 High number of failed inbound TFO connections in esams Mon-Fri In-Scope Open None
Traffic 120486 add a https-only option to dynamicproxy In-Scope Open None
Traffic 120509 Cache education dashboard pages In-Scope Open None
Traffic 141480 mixed-content issues on planet.wikimedia.org In-Scope Open None
Traffic 141373 Age header reset to 0 after 24 hours on varnish frontends In-Scope Open None
Traffic 120631 Security: Is it safe to enable Zero spoofing In-Scope Open None
Traffic 141266 letsencrypt puppetization: add parallel rsa+ecdsa cert support In-Scope Open None
Traffic 138546 Backend naming in VCL needs to use fqdn+port In-Scope Open None
Traffic 138093 Investigate query parameter normalization for MW/services In-Scope Open None
Traffic 137990 Zero: Investigate removing the limit on carrier tagging to m-dot and zero-dot requests In-Scope Open None
Traffic 137979 Support brotli compression In-Scope Open None
Traffic 121561 Encrypt Kafka traffic, and restrict access via ACLs In-Scope Open 0.0
Traffic 137252 Redirect phabricator.mediawiki.org to phabricator.wikimedia.org In-Scope Open None
Traffic 136737 Fix lvs1001-6 storage In-Scope Open None
Traffic 135762 A/B Testing solid framework In-Scope Open None
Traffic 134893 Unhandled pybal error causing services to be depooled in etcd but not in lvs In-Scope Open None
Traffic 134447 letsencrypt puppetization: upgrade for scalability In-Scope Open None
Traffic 122867 Evaluate the feasibility of cache invalidation for the action API In-Scope Open None
Traffic 134404 Varnish support for active:active backend services In-Scope Open None
Traffic 133895 Varnish configuration for mobile domains should be coherent with Apache configuration In-Scope Open None
Traffic 133791 check_dns needs to be rewritten In-Scope Open None
Traffic 133717 Letsencrypt all the prod things we can - planning In-Scope Open None
Traffic 133548 Create a secure redirect service for large count of non-canonical / junk domains In-Scope Open None
Traffic 133410 Deploy TemplateStyles to WMF production In-Scope Open None
Traffic 133149 Move californium to an internal host? In-Scope Open None
Traffic 123854 Set up action API latency / error rate metrics & alerts In-Scope Open None
Traffic 133001 Decom legacy ex-parsoidcache cxserver, citoid, and restbase service hostnames In-Scope Open 0.0
Traffic 132629 Data passed to HHVM ($_SERVER variables) is a mixed bag of already-decoded and non-decoded nonsense In-Scope Open None
Traffic 125938 PHP fatal errors causing Varnish to return 503 - "Junk after gzip data" In-Scope Open None
Traffic 127387 Split slash decoding from general percent normalization in Varnish VCL In-Scope Open None
Traffic 23027 Requests with utf-8 in the URL return a outdated page revision In-Scope Open None
Traffic 172418 Get translations for "IE8 on XP won't work" Screep Open None
Traffic 36670 Check all wikis for inclusions of http resources on https In-Scope Open None
Traffic 172198 setup/install cp402[5-8].ulsfo.wmnet Screep Open None
Traffic 45250 Redo /beacon/impression system (formerly Special:RecordImpression) to remove extra round trips on all FR impressions (title was: S:RI should pyroperish) In-Scope Open None
Traffic 172148 Determine URL paths for Zim files Screep Open None
Traffic 172124 PyBal Feature: progressive depooling strategy for monitored failures Screep Open None
Traffic 172123 Determine how to upload Zim files to Swift infrastructure Screep Open None
Traffic 172116 Improve OCSP fetching and monitoring strategies Screep Open None
Traffic 54253 Protocol-relative URLs are poorly supported or unsupported by a number of HTTP clients In-Scope Open None
Traffic 172103 IPVS issues with UDP services, pybal depooling strategy Screep Open None
Traffic 171967 setup/install cp4022 Screep Open None
Traffic 56783 Respect X-Forwarded-For only from trustworthy sources In-Scope Open None
Traffic 63782 Add varnish logs to logstash In-Scope Open None
Traffic 171966 setup/install cp402[34] Screep Open None
Traffic 66214 Define an official thumb API In-Scope Open None
Traffic 171850 Backport ipvsadm Screep Open None
Traffic 171710 pybal: add prometheus metrics Screep Open None
Traffic 74186 Varnish: Mobile site redirect interferes with OAuth authorization process In-Scope Open None
Traffic 75944 Monitor Varnish caches on beta cluster have two varnishd process running In-Scope Open None
Traffic 171498 Implement machine-local forwarding DNS caches Screep Open None
Traffic 171470 Monitor DNS delegations Screep Open None
Traffic 78421 m.{project}.org portal/redirect consistency In-Scope Open None
Traffic 78963 Support ESI for ResourceLoader In-Scope Open None
Traffic 79730 Add pybal check to ensure service IP is bound In-Scope Open None
Traffic 81305 Make PyBal respect advertised BGP capabilities In-Scope Open None
Traffic 82849 lvs servers report 'Memory allocation problem' on bootup In-Scope Open None
Traffic 83467 LVS testing needs to include internal services testing In-Scope Open None
Traffic 171168 cp1050 apparently stuck while "Initializing firmware interfaces..." Screep Open None
Traffic 84543 more robust certificate chain creation in puppet In-Scope Open None
Traffic 171028 Degraded RAID on cp1008 Screep Open None
Traffic 170847 Icinga check for pybal HTTP connections to etcd Screep Open None
Traffic 170605 ERR_RESPONSE_HEADERS_MULTIPLE_CONTENT_DISPOSITION Screep Open None
Traffic 86915 nan and minnan subdomain redirects are a mess In-Scope Open None
Traffic 170567 Support TLSv1.3 Screep Open None
Traffic 170518 Non zero rated LVS IPs Screep Open None
Traffic 88861 wikipedia.lol In-Scope Open None
Traffic 89838 Move proxy IP lists to META for Varnish XFF decoding In-Scope Open None
Traffic 91372 $wgMFAnonymousEditing = true is sometimes not respected: cache? In-Scope Open None
Traffic 91820 Create HTTP verb and sticky cookie DC routing in VCL In-Scope Open None
Traffic 94125 Central login notice appears on unencrypted API format=*fm pages, where reloading does not affect login status In-Scope Open None
Traffic 169600 Enable diamond PowerDNSRecursor collector on dnsrecursors Screep Open None
Traffic 169175 What is a reasonable per-IP ratelimit for maps Screep Open 2.0
Traffic 168699 Verify that the codfw lvs is configured correctly for Phabricator In-Scope Open None
Traffic 168529 Upgrade to Varnish 5 In-Scope Open None
Traffic 167906 Make API usage limits easier to understand, implement, and more adaptive to varying request costs / concurrency limiting In-Scope Open None
Traffic 167513 Add an redirect lzh.wikipedia to zh-classical.wikipedia In-Scope Open None
Traffic 96499 dbtree loads third party resources (from jquery.com and google.com) In-Scope Open None
Traffic 96844 Update TLS/HTTP documentation on wikitech In-Scope Open None
Traffic 97051 adding new languages to DNS langs.tmpl doesn't work until zone template is edited as well In-Scope Open None
Traffic 167060 en.wiki domain owned by us, but isn't hosted by us?? In-Scope Open None
Traffic 166965 Degraded RAID on lvs3001 In-Scope Open None
Traffic 98165 Figure out an etcd deploy strategy that includes multi DC failure scenarios. In-Scope Open None
Traffic 166782 wikimediafoundation.org's language selector is confusing to most visitors who don't have accounts there In-Scope Open None
Traffic 166758 cp3032 ethernet link down (bnx2x dump in the dmesg) In-Scope Open None
Traffic 99216 Please set up a CNAME for videoserver.wikimedia.org to Video Editing Server In-Scope Open None
Traffic 99531 [Task] move wikiba.se webhosting to wikimedia misc-cluster Screep Open None
Traffic 127482 Enable VCL source-DC switching via confd In-Scope Open None
Traffic 131930 Set SPF (... -all) for toolserver.org In-Scope Open None
Traffic 130904 Host rewrite for /static/ not applied to purges In-Scope Open None
Traffic 124418 Investigate massive increase in htmlCacheUpdate jobs in Dec/Jan In-Scope Open None
Traffic 129839 restrict upload cache access for private wikis In-Scope Open None
Traffic 124954 Decrease max object TTL in varnishes In-Scope Open None
Traffic 129682 Look into solutions for replaying traffic to testing environment(s) In-Scope Open None
Traffic 128559 store.wikimedia.org HTTPS issues In-Scope Open None
Traffic 128409 Detect tools.wmflabs.org tools which are HTTP-only In-Scope Open None
Traffic 125170 Internal DNS resolver responds with NXDOMAIN for localhost AAAA In-Scope Open None
Traffic 128374 Sort out analytics service dependency issues for cp* cache hosts In-Scope Open None
Traffic 128358 Uploading 1.2GB ogv results in 503 In-Scope Open None
Traffic 128188 Make CI run Varnish VCL tests In-Scope Open None
Traffic 128182 Server certificate is classified as invalid on government computers In-Scope Open None
Traffic 127573 wikiknihy.cz - transfer to Wikimedia Czech Republic? In-Scope Open None
Traffic 127485 Enable VCL applayer datacenter-switch via confd In-Scope Open None
DBA 165977 Create CoC committee private wiki In-Scope Cut None
DBA 167031 Global rename of Idh0854 → Garam: supervision needed In-Scope Done None
DBA 115982 Drop the tables old_growth, hitcounter, click_tracking, click_tracking_user_properties from enwiki, maybe other schemas In-Scope Done None
DBA 158893 dbstore1001 troubleshoot IPMI issue In-Scope Done None
DBA 160392 Reset db1070 idrac In-Scope Done None
DBA 162233 eqiad rack/setup 11 new DB servers In-Scope Done None
DBA 164073 Puppetize Piwik's Database and set up periodical backups In-Scope Done None
DBA 169396 Global rename of Markos90 → Mαρκος: supervision needed Screep Done None
DBA 169448 Degraded RAID on db1066 Screep Done None
DBA 169527 Global rename of Antero de Quintal → JMagalhães: supervision needed Screep Done None
DBA 169928 Evaluate how hard would be to get aa(wikibooks|wiktionary) and howiki databases deleted Screep Done None
DBA 170158 Setup database for dmarc service Screep Done None
DBA 170503 Degraded RAID on db2019 Screep Done None
DBA 170941 Global rename of user Moros Screep Done None
DBA 171474 Global rename of Carrotkit → 胡蘿蔔: supervision needed Screep Done None
DBA 171723 Degraded RAID on db1068 Screep Done None
DBA 172265 es2013 faulty BBU Screep Done None
DBA 109179 Migrate MySQLs to use ROW-based replication In-Scope Open None
DBA 162699 Decomissions old s2 eqiad hosts (db1018, db1021, db1024, db1036) In-Scope Open None
DBA 162789 Create less overhead on bacula jobs when dumping production databases In-Scope Open None
DBA 152427 Create a check/calendar alert for MariaDB TLS certs In-Scope Open None
DBA 168584 Labsdb* servers need to be rebooted In-Scope Open None
DBA 157359 labsdb1006/1007 (postgresql) maintenance In-Scope Open None
DBA 157702 Followup for TLS MariaDB server roll-out In-Scope Open None
DBA 107610 Setup separate logical External Store for Flow in production In-Scope Open None
DBA 163143 dbtree: don't return 200 on error pages In-Scope Open None
DBA 163303 Increase timeout for mariadb replication check In-Scope Open None
DBA 163339 pdu phase inbalances: ps1-a3-codfw, ps1-c6-codfw, & ps1-d6-codfw In-Scope Open None
DBA 148078 Decommission db1015, db1035, db1044 and db1038 In-Scope Open None
DBA 119154 Move echo tables from local wiki databases onto extension1 cluster for mediawikiwiki, metawiki, and officewiki In-Scope Open None
DBA 141547 Setup automatic failover for misc database servers In-Scope Open None
DBA 169501 Move some masters away from B6 Screep Open None
DBA 112473 Better mysql monitoring for number of connections and processlist strange patterns In-Scope Open None
DBA 163778 Decommission db1022 (Was: db1022 broke while changing topology on s6- evaluate if to fix or directly decommission) In-Scope Open None
DBA 164173 Cache invalidations coming from the JobQueue are causing lag on several wikis In-Scope Open None
DBA 112282 Multiple pages with no revisions In-Scope Open None
DBA 145072 Create a script to regenerate prometheus mysqld exporter listing that works with puppetdb In-Scope Open None
DBA 134476 Decommission old coredb machines (<=db1050) In-Scope Open None
DBA 153440 Create a full backup of all external storage records that would be easy to restore/setup a temporary delayed slave In-Scope Open None
DBA 135851 Preserve InnoDB table auto_increment on restart In-Scope Open None
DBA 141968 Display lag on grafana (prometheus) and dbtree from pt-heartbeat instead (or in addition) of Seconds_Behind_Master In-Scope Open None
DBA 104699 Firewall configurations for database hosts In-Scope Open None
DBA 164702 Decommission db1024 In-Scope Open None
DBA 148955 Puppetize tendril web user creation In-Scope Open None
DBA 145885 Gerrit shows HTTP 500 error when pasting extended unicode characters In-Scope Open None
DBA 164834 In some database hosts, performance schema loses digest statistics In-Scope Open None
DBA 134809 Apache <=> mariadb SSL/TLS for cross-datacenter writes In-Scope Open None
DBA 165625 Evaluate future of wmf puppet module "mysql" In-Scope Open None
DBA 171071 Perform testing for TLS effect on connection rate Screep Open None
DBA 165674 Investigate slow servermon updating queries on db1016 In-Scope Open None
DBA 165677 Create a backend check for pybal to monitor the MySQL protocol being up In-Scope Open None
DBA 149643 Review Icinga alarms with disabled notifications In-Scope Open None
DBA 119626 Eliminate SPOF at the main database infrastructure In-Scope Open None
DBA 155764 Gerrit: Schedule downtime to migrate db to utf8mb4 In-Scope Open None
DBA 166108 x1 master db1031: Faulty BBU In-Scope Open None
DBA 133523 [RFC] improve parsercache replication and sharding handling In-Scope Open None
DBA 143896 MySQL monitoring with prometheus In-Scope Open None
DBA 166486 Decommission db1023 In-Scope Open None
DBA 100501 mysql user and group should be a system user/group In-Scope Open None
DBA 151491 Icinga MariaDB disk space check on silver checks the wrong partition In-Scope Open None
DBA 161754 eqiad: (2) hardware access request for labsdb1004 & 5 refresh In-Scope Open None
DBA 161755 eqiad: (2) hardware access request for labsdb1006 & 7 refresh In-Scope Open None
DBA 141252 icinga hp raid check timeout on busy ms-be and db machines In-Scope Open None
DBA 141255 Separate host lookup from the sql shell script In-Scope Open None
DBA 54932 Drop *_old database tables from Wikimedia wikis In-Scope Open None
DBA 50930 Database replication problems - production and labs (tracking) In-Scope Open None
DBA 151999 Create script to monitor db dumps for backups are successful (and if not, old backups are not deleted) In-Scope Open None
DBA 156844 Prep to decommission old dbstore hosts (db1046, db1047) In-Scope Open None
DBA 172498 Switch databases to the future parser Screep Open None
DBA 126252 Populate the wikishared db on all dbstores In-Scope Open None
DBA 162070 Cleanup or remove mysql puppet module; repurpose mariadb module to cover misc use cases In-Scope Open None
DBA 127570 Rename be_x_oldwiki database to be_taraskwiki In-Scope Open None
Software Development 169619 Degraded RAID on ms-be2024 Screep Done None
Software Development 157002 Puppet compiler: re-add the concurrency option NUM_THREADS In-Scope Open None
Software Development 144264 wmf-reimage and handling of "-n" option In-Scope Open None
Software Development 157133 Consider adding a --skip-conftool option to puppet-merge In-Scope Open None
Software Development 152950 E901 SyntaxError: invalid syntax is wrongly raised on using python's abc by jenkins python CI linter In-Scope Open None
Software Development 154776 Puppet compiler: order resources for easy comparison between hosts In-Scope Open None
Software Development 166300 Remove Salt from wmf-auto-reimage / wmf-reimage In-Scope Open None
Software Development 144169 Flake8 for python files without extension in puppet repo In-Scope Open None
Software Development 148494 Add shell scripts CI validations In-Scope Open None
Software Development 164780 Sunset our use of Salt In-Scope Open None
Software Development 86556 monitor SSD wear levels In-Scope Open None
Software Development 164817 Migrate debdeploy to cumin In-Scope Open None
Software Development 167504 New tool to track package updates/status for hosts and images (debmonitor) In-Scope Open None
Software Development 143536 Upgrade all mw* servers to debian jessie In-Scope Open None
Software Development 150560 More verbose messages from service-checker-swagger In-Scope Open None
Software Development 164587 cumin could use randomization/splay options In-Scope Open None
Software Development 155705 confctl: log to SAL even if the selection doesn't match any host In-Scope Open None
Software Development 159045 Update Puppet repo code that uses maniphest.update and maniphest.createtask conduit api In-Scope Open None
Software Development 157001 Puppet compiler: abort on git rebase conflict In-Scope Open None
Hardware Requests 168472 reclaim/decom tmh200[12] In-Scope Done None
Hardware Requests 162785 Decommission ms-be2001 - ms-be2012 In-Scope Done None
Hardware Requests 169506 Decommission subra/suhail Screep Done None
Hardware Requests 164588 decom mira In-Scope Done None
Hardware Requests 166341 SSDs for main Kafka clusters In-Scope Open None
Hardware Requests 170441 Decommission mw1196 Screep Open None
Hardware Requests 130883 decom cp3011-22 (12 machines) In-Scope Open None
Hardware Requests 168271 Decommission mw1170-mw1179 In-Scope Open None
Hardware Requests 172487 decom iridium Screep Open None
Hardware Requests 159996 decom fluorine In-Scope Open None
Hardware Requests 168559 decom silver (was silver has trouble rebooting) In-Scope Open None
Hardware Requests 170157 decommission rcs100[12] Screep Open 3.0
Hardware Requests 171018 decom netmon1001 Screep Open None
Hardware Requests 159480 Decommission bast3001 In-Scope Open None
Hardware Requests 95742 Decomission amssq31-62 (32 hosts) In-Scope Open None
Hardware Requests 166489 Decommission ms-be1001 - ms-be1012 In-Scope Open None
Hardware Requests 169020 Decommission cp400[1-4] In-Scope Open None
Hardware Requests 172323 Decommission WMF3248 (old R510) Screep Open None
Hardware Requests 160986 Decommission ms-fe100[1-4] In-Scope Open None
Hardware Requests 167377 Decommission cp4011, cp4012, cp4019, cp4020 In-Scope Open None
Hardware Requests 167376 Decommission cp300[3456] In-Scope Open None
Hardware Requests 171179 Decommisson restbase-dev100[1-3] Screep Open None
Other Operations 123147 Wikipedia.com warns about bad certificate Unknown None
Other Operations 132921 Unable to delete file pages on commons: MWException/LocalFileLockError: "Could not acquire lock" In-Scope Cut None
Other Operations 71336 VIPS scaled thumbnails don't have a comment with a link to the file description page In-Scope Cut None
Other Operations 10217 Wikipedias with zh-* language codes waiting to be renamed (zh-min-nan -> nan, zh-yue -> yue, zh-classical -> lzh) In-Scope Cut None
Other Operations 158029 luasandbox profiler doesn't sort results due to an HHVM bug In-Scope Cut None
Other Operations 170307 mw2201, mw2202 - contact Dell and replace main board Screep Done None
Other Operations 172285 wikitech-static.wikimedia.org certificate renewal (expiring 2017-08-09) Screep Done None
Other Operations 170361 Remove mbrar@wikimedia.org from legal-tm-vio@wikimedia.org Screep Done None
Other Operations 87840 Retire Torrus Screep Done None
Other Operations 172330 broken dependencies for python-snimpy on jessie Screep Done None
Other Operations 146113 wtp2019 - hardware (RAM) check In-Scope Done None
Other Operations 159756 setup netmon1002.wikimedia.org In-Scope Done None
Other Operations 165171 rack/setup/install ores1001-1009 In-Scope Done None
Other Operations 170496 Firewalls appear to be preventing spark executors from talking to spark driver on stat1005 Screep Done 3.0
Other Operations 134890 check status of multiple systemd units In-Scope Done None
Other Operations 129222 Icinga disk space check should also check inode usage In-Scope Done None
Other Operations 154658 Prepare and improve the datacenter switchover procedure In-Scope Done None
Other Operations 170551 Remove "lucie" from the wmde LDAP group Screep Done None
Other Operations 170552 Add "chrisneuroth" to wmde LDAP group Screep Done None
Other Operations 86971 Decide on /var/lib vs /home as locations of homedir for mwdeploy In-Scope Done None
Other Operations 170592 Requesting access to recommendation-api for nschaaf Screep Done None
Other Operations 170613 determine model/serial info for kvm-ulsfo Screep Done None
Other Operations 165366 rack/setup/install replacement stat1006 (stat1003 replacement) In-Scope Done None
Other Operations 165368 rack/setup/install replacement to stat1005 (stat1002 replacement) In-Scope Done None
Other Operations 170653 create netmon1003, migrate servermon from netmon1001 to netmon1003 Screep Done None
Other Operations 170655 VM request: netmon1003 Screep Done None
Other Operations 170679 LDAP access to the wmf group for Anne Gomez Screep Done None
Other Operations 160097 audit spare disk levels for codfw & eqiad utlized storage in servers In-Scope Done None
Other Operations 170803 Reset admin password for Huggle mailing list Screep Done None
Other Operations 149298 Reimage/rename codfw pool counters In-Scope Done None
Other Operations 125205 Monitor hardware thermal issues In-Scope Done None
Other Operations 165531 rack/setup/install labvirt101[5-8] In-Scope Done None
Other Operations 170854 Update mediawiki on wikitech-static Screep Done None
Other Operations 170886 ocg1001 is broken Screep Done None
Other Operations 170891 Make SPF for wikipedia.org more strict Screep Done None
Other Operations 170930 Add Dinka Wikipedia to Wikidata Screep Done None
Other Operations 170932 prometheus-puppet-agent-stats cronspam on missing puppet stats Screep Done None
Other Operations 104258 Create instrumentation to monitor load on geoiplookup.wikimedia.org In-Scope Done None
Other Operations 149432 puppet compiler claims "no change" when catalogs are actually different In-Scope Done None
Other Operations 172409 Copper root (/) 95% full Screep Done None
Other Operations 133110 Check for an oversized exim4 queue indicating mail delivery failures In-Scope Done None
Other Operations 167279 Create "network" icinga group Screep Done None
Other Operations 120683 logrotate/disk space on silver for nutcracker log In-Scope Done None
Other Operations 149557 Site: 2 VM request for tendril (switch tendril from einsteinium to dbmonitor*) In-Scope Done None
Other Operations 151075 setup/install restbase-dev100[123] In-Scope Done None
Other Operations 143405 Move labs 'instances' data to graphite labs In-Scope Done None
Other Operations 171174 a lot of beta cluster instances are not reachable over SSH Screep Done None
Other Operations 171177 New instance in deployment prep can't run puppet for the first time Screep Done None
Other Operations 171183 Degraded RAID on ms-be1016 Screep Done None
Other Operations 156982 Cleanup tools nfs share on labstore1004/5 In-Scope Done None
Other Operations 171232 Degraded RAID on db1001 Screep Done None
Other Operations 171275 ms-be2024 not powering on Screep Done None
Other Operations 171280 wikitech api list=novainstances not returning list of instances Screep Done None
Other Operations 78342 Create a basic RSpec unit test for operations/puppet In-Scope Done None
Other Operations 149812 Set up docker building environment for production In-Scope Done None
Other Operations 167664 New Service Request: recommendation-api In-Scope Done None
Other Operations 138314 mobileapps 500s following reboot of restbase1007 In-Scope Done None
Other Operations 166180 rack/setup/install netmon2001 In-Scope Done None
Other Operations 166181 rack/setup/install restbase-dev100[456] In-Scope Done None
Other Operations 133979 puppet compiler error on catalog with non-ascii output In-Scope Done None
Other Operations 171538 Degraded RAID on labsdb1001 Screep Done None
Other Operations 171580 Diamond log level set to DEBUG spams syslog Screep Done None
Other Operations 171583 Diamond collectors collects NFS statistics on Cloud-VPS Screep Done None
Other Operations 166240 Determine if benefactorevents.wikimedia.org should be hosted on the production cluster or still on Microsoft Azure In-Scope Done None
Other Operations 127602 Reboot during puppet run causes /var/lib/puppet/state/agent_catalog_run.lock to be left and puppet to not start running again In-Scope Done None
Other Operations 172559 Ensure getLagTimes.php is working properly Screep Done None
Other Operations 167714 Create Atikamekw Wikipedia In-Scope Done None
Other Operations 166587 Ops Onboarding for Keith Herron In-Scope Done None
Other Operations 126221 Evaluate efficacy of DateTieredCompactionStrategy In-Scope Done None
Other Operations 127488 (re)move problemsdonating aliases In-Scope Done None
Other Operations 113733 column family cassandra metrics size In-Scope Done None
Other Operations 166888 CI for operations/puppet is taking too long In-Scope Done None
Other Operations 119541 Self hosted puppetmaster is broken In-Scope Done None
Other Operations 171880 Add AAAA records for labpuppetmaster1001 and 1002 Screep Done None
Other Operations 167763 Troubleshoot scb2005 NICs In-Scope Done None
Other Operations 95679 Make a puppet role that sets up a query service and loads it Screep Done None
Other Operations 152129 reinstall iridium (phabricator) as phab1001 with jessie In-Scope Done None
Other Operations 167905 rack/setup/install labpuppetmaster100[12].wikimedia.org In-Scope Done None
Other Operations 157089 Add storage to Change-Prop for deduplication In-Scope Done None
Other Operations 152340 labservices1001 down, suspected overheating In-Scope Done None
Other Operations 171903 mw1209 /usr/bin/timeout: the monitored command dumped core Screep Done None
Other Operations 162735 Hyperthreading disabled on restbase2002.codfw.wmnet & restbase1015.codfw.wmnet In-Scope Done None
Other Operations 162765 Set up grafana alerting for services In-Scope Done None
Other Operations 162770 SATA errors for stat1004 in the dmesg In-Scope Done None
Other Operations 162780 ocg1003 partitions are severely misconfigured In-Scope Done None
Other Operations 168442 Grant AWight accounts on ores production clusters In-Scope Done None
Other Operations 168444 Logo for sr.wikiquote.org Screep Done None
Other Operations 171917 setup releases2001.codfw.wmnet Screep Done None
Other Operations 162792 Reduce Swift technical debt In-Scope Done None
Other Operations 162796 Delete non-used and/or non-requested thumbnail sizes periodically In-Scope Done None
Other Operations 168518 Create Dinka Wikipedia In-Scope Done None
Other Operations 107819 Need sudo to blazegraph on wdqs1001/1002 Screep Done None
Other Operations 168534 scb1003 unresponsive after reboot In-Scope Done None
Other Operations 161717 Point swiftrepl to swift HTTPS In-Scope Done None
Other Operations 171924 notebook100[12] - Invalid relationship: Apt::Pin[r-base] Screep Done None
Other Operations 171926 Degraded RAID on ms-be1017 Screep Done None
Other Operations 162949 hosts with puppet compiler failures on every run In-Scope Done None
Other Operations 168683 Upgrade pandoc package to at least 1.12.3 In-Scope Done None
Other Operations 112648 enable restbase syslog/file logging In-Scope Done None
Other Operations 168705 nutcracker test config in puppet doesn't work In-Scope Done None
Other Operations 168764 Reopen Wikinews Dutch In-Scope Done None
Other Operations 157496 A few hosts never get clean puppet compiler runs In-Scope Done None
Other Operations 171928 Wikidata and dewiki databases locked Screep Done None
Other Operations 168782 Create fishbowl wiki for Maithili Wikimedians User Group Screep Done None
Other Operations 1075 Audit groups of metrics in Graphite that allocate a lot of disk space In-Scope Done None
Other Operations 168881 Rename mw2148 / mw2149 / mw2259 / mw2260 to thumbor200[1234] In-Scope Done None
Other Operations 171959 instance root passwords vs. multiple puppetmasters Screep Done None
Other Operations 168892 rack/setup/install labtestservices2002.wikimedia.org In-Scope Done None
Other Operations 168893 rack/setup/install labtestservices2003.wikimedia.org In-Scope Done None
Other Operations 168894 rack/setup/install labtestcontrol2003.wikimedia.org In-Scope Done None
Other Operations 125629 Depool proxies temporarily while scap is ongoing to avoid taxing those nodes In-Scope Done None
Other Operations 168927 Smartctl errors for one kafka1012 disk In-Scope Done 3.0
Other Operations 168962 wikitech-static sync check shouldn't happen so often In-Scope Done None
Other Operations 157807 Reinstall Analytics Hadoop Cluster with Debian Jessie In-Scope Done 0.0
Other Operations 169023 Remove left-over alias for wikidata.org/ontology (doesn't work) In-Scope Done None
Other Operations 157853 Replace nrpe 2.15 (& evaluate alternatives) In-Scope Done None
Other Operations 169114 Beta thumbnails are broken Screep Done None
Other Operations 167333 Increase email log retention period for the main email relays In-Scope Done None
Other Operations 119915 Create response time monitoring for WDQS endpoint In-Scope Done None
Other Operations 150434 Build calico In-Scope Done None
Other Operations 150456 puppet compiler fails with modules using puppetdb In-Scope Done None
Other Operations 169299 Mobileapps swagger spec is broken (no pronounciation for `page/mobile-sections-lead` endpoints) In-Scope Done None
Other Operations 169312 Implement poolcounter failover in Thumbor In-Scope Done None
Other Operations 169313 Investigate poolcounter failure leading to thumbor failing to generate thumbs In-Scope Done None
Other Operations 169321 Monitor all management interfaces In-Scope Done None
Other Operations 137181 Update restbase catchpoint metric In-Scope Done None
Other Operations 172008 librenms - syslog stopped working after migration Screep Done None
Other Operations 169360 Unresponsive/misconfigured iDRACs over the host-BMC interface In-Scope Done None
Other Operations 169485 Add support for directory environments to our puppet classes, production puppetmaster Screep Done None
Other Operations 129645 Remove sca100x from the list of Mathoid's minions In-Scope Done None
Other Operations 172111 Fix fqdn for promethium Screep Done None
Other Operations 169546 Add results of compilation with the future parser to the puppet compiler Screep Done None
Other Operations 172115 ganeti2003 ipmi_sdr_cache_create: internal IPMI error Screep Done None
Other Operations 169566 Site: (2) VM request for DMARC Screep Done None
Other Operations 133191 Make SPF for wikimedia.org more strict In-Scope Done None
Other Operations 163938 setup/install phab1001.eqiad.wmnet In-Scope Done None
Other Operations 169601 Complete stretch reimage for ms-fe / ms-be fleet Screep Done None
Other Operations 169605 Default to ext4 instead of ext3 Screep Done None
Other Operations 169612 tlsproxy fail on ms-fe2005 with stretch Screep Done None
Other Operations 158913 Move labstore1002 and labstore1002-array1 and labstore1002-array2 to different rack (currently in C3) In-Scope Done None
Other Operations 163998 check_hpssacli should report on battery failures and cache disabled In-Scope Done None
Other Operations 164030 setup releases1001.eqiad.wmnet (was: setup mwreleases1001) In-Scope Done None
Other Operations 133696 NIH db misbehaviour causing problems to Citoid In-Scope Done 1.0
Other Operations 172689 Install missing Spamassassin DKIM dependencies on lists and mx Screep Done None
Other Operations 169696 restbase-dev1003 stuck after reboot Screep Done None
Other Operations 169698 jupyterhub.service in failed state on notebook1001 due to removed user Screep Done None
Other Operations 169773 Upgrade grafana to 4.4.1 Screep Done None
Other Operations 169822 Run updateArticleCount.php on Wikimedia Commons Screep Done None
Other Operations 169871 mgmt inaccessible on restbase1018 Screep Done None
Other Operations 164206 Icinga loses downtime entries, causing alert and page spam In-Scope Done None
Other Operations 164209 webpagetest-alerts: Difference in size authenticated In-Scope Done None
Other Operations 150160 Remote IPMI doesn't work for ~2% of the fleet In-Scope Done None
Other Operations 153099 Initial OpenStack Neutron PoC deployment in Labtest In-Scope Done None
Other Operations 151648 Implement storage policies for swift In-Scope Done None
Other Operations 169953 postgresql::ganglia on puppetdb servers - authentication failed Screep Done None
Other Operations 169959 bast3002 didn't come up after reboot Screep Done None
Other Operations 167138 labvirt1008/labsdb1001: FreeIPMI returned an empty header map In-Scope Done None
Other Operations 167157 rack/setup/install labtestpuppetmaster2001 In-Scope Done None
Other Operations 105185 Need deploy rights for Wikidata Query Service Screep Done None
Other Operations 151748 Cron conflict for kafkatee logrotate on oxygen In-Scope Done None
Other Operations 170139 remove icinga monitoring for benefactorevents.wm.o SSL certificate Screep Done None
Other Operations 104879 Define the details of the hardware we need to run WDQS Screep Done None
Other Operations 172218 Unable to locate package libgumbo-dev Screep Done None
Other Operations 172254 Double quotes in nutcracker config make json stats invalid json Screep Done None
Other Operations 156495 Upgrade nginx on notebook* servers In-Scope Done None
Other Operations 116747 Meta task "Revamp user authentication" In-Scope Open None
Other Operations 152100 should we make privatewiki list available to puppet without maintaining two lists? In-Scope Open None
Other Operations 152439 cronspam from labtestservices2001 /etc/dns-floating-ip-updater.py > /dev/null In-Scope Open None
Other Operations 152445 Move prometheus entry point off port 80 In-Scope Open None
Other Operations 152562 Port fundraising stats off Ganglia In-Scope Open None
Other Operations 116742 Track amount of package updates on systems In-Scope Open None
Other Operations 152632 Explore hosting the multimedia commons use case In-Scope Open None
Other Operations 152724 Current state and next steps for RESTBase storage In-Scope Open None
Other Operations 152767 Missing Labs hiera entry in labs-private repo In-Scope Open None
Other Operations 152782 Kibana functionality missing after upgrade: histograms In-Scope Open None
Other Operations 152791 Improvements to Ganglia-equivalent Prometheus dashboards In-Scope Open None
Other Operations 116627 Include 5xx numbers in fluorine fatalmonitor In-Scope Open None
Other Operations 153068 Consider mounting labs NFS labstore1003.eqiad.wmnet:/scratch for server-side uploads In-Scope Open None
Other Operations 153083 Investigate I/O limits on elasticsearch servers In-Scope Open None
Other Operations 153246 Puppet failures with "Attempt to assign to a reserved variable name: 'trusted'" In-Scope Open None
Other Operations 153279 labnet/ labtestnet2001 - disk space - nova-api.log needs rotation In-Scope Open None
Other Operations 153416 docker-engine pulled into our repositories only keeps the latest version In-Scope Open None
Other Operations 116580 monitor postgresql replication status In-Scope Open None
Other Operations 116288 Install mailman-api for internal use In-Scope Open None
Other Operations 153703 Option: Consider switching back to leveled compaction (LCS) In-Scope Open None
Other Operations 153816 apache::static_site is not working In-Scope Open None
Other Operations 153940 Logrotate fails for: "$FILE No such file or directory" In-Scope Open None
Other Operations 116063 Hardware Automation Workflow - Overall Tracking In-Scope Open None
Other Operations 154026 On mobile, http://wikipedia.org/wiki/Foo redirects to https://www.m.wikipedia.org/wiki/Foo which does not exist Screep Open None
Other Operations 115899 Move scap target configuration to etcd In-Scope Open None
Other Operations 154619 Export ipsec counters as Prometheus metrics Screep Open None
Other Operations 154627 Production error message (when servers are down) points users to donate link which is likely to produce the same error message In-Scope Open None
Other Operations 154665 Look into behaviour of /etc/exim4/update-exim4.conf.conf related to updates In-Scope Open None
Other Operations 115757 document debian packaging guidelines In-Scope Open None
Other Operations 115194 Some labs instances IP have multiple PTR entries in DNS In-Scope Open None
Other Operations 114849 Log lines on flourine overflow at 8092 bytes. In-Scope Open None
Other Operations 154915 Get rid of "import realm.pp" in manifests/site.pp In-Scope Open None
Other Operations 114801 operations-apache-config-lint replacement doesn't check syntax In-Scope Open None
Other Operations 155129 Create prometheus nutcracker exporter In-Scope Open None
Other Operations 155209 Increase $wgHTTPImportTimeout to a higher value on WMF wikis In-Scope Open None
Other Operations 114446 move human users out of UID range for system accounts In-Scope Open None
Other Operations 155401 Integrate jessie 8.7 point release In-Scope Open None
Other Operations 155761 DNS repo: add Jenkins job to ensure there are no duplicates In-Scope Open None
Other Operations 114337 Assign 3 more servers to video scaler duty In-Scope Open None
Other Operations 155869 Fix permissions for systemd file In-Scope Open None
Other Operations 155929 Create /community-beacon alternative entry point In-Scope Open None
Other Operations 113792 Change LDAP cn to something more useful (was Rename "Dzahn" to "Daniel Zahn" in Gerrit) In-Scope Open None
Other Operations 113785 Make the Shinken IRC alert and icinga-wm bots use colors In-Scope Open None
Other Operations 156136 Increase swift replication factor for accounts In-Scope Open None
Other Operations 156140 Lots of hosts with hyperthreading disabled In-Scope Open None
Other Operations 156143 High CPU usage from swift-proxy on frontend machines In-Scope Open None
Other Operations 156232 confctl SubjectAltNameWarning after python-urllib3 upgrade In-Scope Open None
Other Operations 113104 Set up a service IP for logstash In-Scope Open None
Other Operations 156398 Decommission or repair old asw-c2-eqiad In-Scope Open None
Other Operations 156475 Investigate spike in 500s during asw-c2-eqiad replacement In-Scope Open None
Other Operations 156544 Create backups of Wikimedia content in diverse geographic places In-Scope Open None
Other Operations 156570 Investigate issues with wikitech-static.wikimedia.org In-Scope Open None
Other Operations 156924 Allow integration of data from etcd into the MediaWiki configuration In-Scope Open None
Other Operations 156937 Provide cross-dc redundancy (active-active or active-passive) to all important misc services In-Scope Open None
Other Operations 156955 Standardizing our partman recipes In-Scope Open None
Other Operations 157038 Make it possible to run the mediawiki testsuite against a staging repo of apt.wikimedia.org In-Scope Open None
Other Operations 157306 Fix config file handling for /etc/hhvm/php.ini In-Scope Open None
Other Operations 112774 solve mtp panel issue for row uplinks In-Scope Open None
Other Operations 157761 use htpasswd instead of htdigest for arbcom archive passwords In-Scope Open None
Other Operations 157972 Puppet fails only once when restarting ferm is not successful In-Scope Open None
Other Operations 158022 make apt.wikimedia.org HA In-Scope Open None
Other Operations 158196 Reimage labstore1001 and labstore1002 for DRBD storage setup In-Scope Open None
Other Operations 158288 Unclean stop of jobrunner service via puppet In-Scope Open None
Other Operations 158429 Switch to predictable network interface names? In-Scope Open None
Other Operations 158434 Phabricator: Make sure phabricator works properly including our puppet roles on jessie In-Scope Open None
Other Operations 158562 Manage apt sources via puppet? In-Scope Open None
Other Operations 158583 Restructure our internal repositories further In-Scope Open None
Other Operations 158757 Puppet certificate missing subjectAltName In-Scope Open None
Other Operations 158837 Consolidate performance website and related software In-Scope Open None
Other Operations 172628 conf2002 etcdmirror-conftool-eqiad-wmnet died Screep Open None
Other Operations 158915 Make sure replying to emails in gerrit 2.14 works In-Scope Open None
Other Operations 112257 rename cassandra cluster In-Scope Open None
Other Operations 159242 Segmentation fault creating thumbnail In-Scope Open None
Other Operations 111934 Nutcracker stats monitoring should only listen on localhost In-Scope Open None
Other Operations 159354 Move coal from graphite machine(s) In-Scope Open None
Other Operations 111838 Some files had disappeared from Commons after renaming In-Scope Open None
Other Operations 111595 Do not apply spam headers on email assessed NOT to be spam In-Scope Open None
Other Operations 111540 Clean up labs graphite datapoints In-Scope Open None
Other Operations 159524 backup space is used unwisely In-Scope Open None
Other Operations 159536 Puppet constantly trying to stop the already stopped puppetmaster process on Trusty In-Scope Open None
Other Operations 159661 Improve Terbium (and wasat) userland to process server side uploads In-Scope Open None
Other Operations 159687 etcd switchover/enhancements In-Scope Open None
Other Operations 159750 E-mail for people in different OIT LDAP object unit In-Scope Open None
Other Operations 159830 Sanity check global-multiwrite logs for ConfirmEdit usage In-Scope Open None
Other Operations 159922 pdfrender fails to serve requests since Mar 8 00:30:32 UTC on scb1003 In-Scope Open None
Other Operations 160060 Icinga check for sysctl settings In-Scope Open None
Other Operations 160071 Add slabinfo prometheus exporter In-Scope Open None
Other Operations 160101 Upgrade php5-json .deb to at least 1.3.8 In-Scope Open None
Other Operations 160146 jobrunner/jobchron services fail in codfw In-Scope Open None
Other Operations 160158 Make disabled accounts visible in the corp mirror LDAP replica In-Scope Open None
Other Operations 160229 Back up of Commons files In-Scope Open None
Other Operations 172633 Analytics1034 eth0 negotiated speed to 100Mb/s instead of 1000Mb/s Screep Open 1.0
Other Operations 160412 Add lock_wait_timeout to maintain_views and maintain-meta_p In-Scope Open None
Other Operations 160529 Sender email spoofing In-Scope Open None
Other Operations 110240 [Discussion] Consider validating JSON schemas when running x-ample tests? In-Scope Open None
Other Operations 160677 Effects on adjusting Prometheus retention In-Scope Open None
Other Operations 160941 Improve SSH access information in onboarding documentation In-Scope Open None
Other Operations 161003 Cross-check disabled accounts from corp LDAP against data.yaml In-Scope Open None
Other Operations 161004 Remove disabled users from internal mailing lists In-Scope Open None
Other Operations 161096 confctl no longer logs a non-changing state change In-Scope Open None
Other Operations 110171 Alert when ES indexes are freezed for more than 30 minutes In-Scope Open None
Other Operations 161145 Fix the general problem of randomly-bad puppet agent cron timings within redundant clusters In-Scope Open None
Other Operations 110169 Monitor redis memory/disk usage In-Scope Open None
Other Operations 161296 Upgrade mysqld_exporter to 0.10.0 In-Scope Open None
Other Operations 109606 Re-evaluate Limesurvey In-Scope Open None
Other Operations 161528 incident 20170323-wikibase did not trigger Icinga paging In-Scope Open None
Other Operations 161566 add support to offboard-user to support mailman list removal In-Scope Open None
Other Operations 161598 Monitor HHVM bytecode cache depletion on mediawiki app servers In-Scope Open None
Other Operations 161834 Undo special tools-home and tools-project share definitions for NFS In-Scope Open None
Other Operations 161835 Convert labstore cluster configuration to hiera and profiles In-Scope Open None
Other Operations 161864 404 error while accessing some images files e.g. djvu and jpg In-Scope Open None
Other Operations 161899 Investigate ceasing self-service new Trusty instance creation in Labs In-Scope Open None
Other Operations 161904 decommission backup4001 In-Scope Open None
Other Operations 161918 videoscalers (mw1168, mw1169) - high load / overheating In-Scope Open None
Other Operations 161920 logrotate for ruthenium In-Scope Open None
Other Operations 162013 etcd cluster in codfw has raft consensus issues In-Scope Open None
Other Operations 162029 Migrate all jessie hosts to Linux 4.9 In-Scope Open None
Other Operations 162037 Use SSL certificates with discovery entry for elasticsearch In-Scope Open None
Other Operations 162039 Prepare to service applications from kubernetes In-Scope Open None
Other Operations 162043 Define a process to keep images up-to-date on similar standards as the rest of production In-Scope Open None
Other Operations 162090 Investigate alternative RAID strategies for labstore1001/2 In-Scope Open None
Other Operations 109090 Investigate the need for master only (non data nodes) in our ES cluster In-Scope Open None
Other Operations 162122 Swiftrepl was stuck in an infinite loop since days In-Scope Open None
Other Operations 162123 Running swiftrepl is not puppetized In-Scope Open None
Other Operations 172681 Analytics Kafka cluster causing timeouts to Varnishkafka since July 28th Screep Open None
Other Operations 162245 Enable GC for HHVM CLI (at least for dump runners) In-Scope Open None
Other Operations 109089 EPIC: Cultivating the Elasticsearch garden (operational lessons from 1.7.1 upgrade) In-Scope Open None
Other Operations 108985 Monitor MediaWiki sessions In-Scope Open None
Other Operations 162850 CPU throttling on DELL PowerEdge R320 In-Scope Open None
Other Operations 162857 Some Core availability Catchpoint tests might be more expensive than they need to be In-Scope Open None
Other Operations 162955 rebuild tools-grid-master as a large instance In-Scope Open None
Other Operations 163033 Create grafana dashboard for video scaler job runners In-Scope Open None
Other Operations 163068 More missing 'original' files on Commons In-Scope Open None
Other Operations 107267 Backport python-shade from debian/testing to jessie-wikimedia In-Scope Open None
Other Operations 107108 Flow notification links on mobile point to desktop In-Scope Open None
Other Operations 163288 Decide on /var/lib vs /home as locations of homedir for l10nupdate In-Scope Open None
Other Operations 163336 kube-proxy pulls in docker and starts service even when it isnt needed In-Scope Open None
Other Operations 163346 mw2256 - hardware issue In-Scope Open None
Other Operations 163354 Find a way to verify mediawiki-config IPs ahead of datacenter switchovers In-Scope Open None
Other Operations 163362 audit all codfw pdu tower draws In-Scope Open None
Other Operations 163393 Determine appropriate proxy_read_timeout setting for Tools Proxy In-Scope Open None
Other Operations 163402 Ensure we can survive a loss of labservices1001 In-Scope Open None
Other Operations 163507 Intermittent DB connectivity problem on phabricator, needs investigation In-Scope Open None
Other Operations 106937 Monitor [[Special:ListFiles]] for non 200 HTTP statuses in thumbnails In-Scope Open None
Other Operations 163667 Fix UIDs for deployment server users In-Scope Open None
Other Operations 163673 Some swift disks wrongly mounted on 5 ms-be hosts In-Scope Open None
Other Operations 106664 Set up role accounts and feedback loops (FBL) with all providers In-Scope Open None
Other Operations 163698 Add flood protection to the ircecho bot (icinga-wm) In-Scope Open None
Other Operations 163823 During labservices1001 failover fqdn changed from foo.project.eqiad.wmflabs to foo.eqiad.wmflabs In-Scope Open None
Other Operations 163996 Icinga check for ipv6 host reachability In-Scope Open None
Other Operations 164042 Racktables: clearly show when hosts are decommissioned In-Scope Open None
Other Operations 164123 tools-k8s-master-01 has two floating IPs In-Scope Open None
Other Operations 164238 move icinga contacts file to public repo In-Scope Open None
Other Operations 164248 HTTP responses from app servers sometimes stall for >1s In-Scope Open None
Other Operations 106346 setup an alertable threshold for Cassandra heap dumps In-Scope Open None
Other Operations 164290 Set up external DNS record for wikitech-static In-Scope Open None
Other Operations 105780 Create a doc explaining the SLA between services and the monitoring tool In-Scope Open None
Other Operations 164341 Decommission old memcached hosts - mc1001->mc1018 In-Scope Open None
Other Operations 164460 Use DNS discovery record for deployment CNAME In-Scope Open None
Other Operations 164490 maintain-meta_p hangs on connecting to wikimedia.org.uk In-Scope Open None
Other Operations 164703 Integrate jessie 8.8 point release In-Scope Open None
Other Operations 104671 Rename 'restricted' group? Screep Open None
Other Operations 164819 reprepro: Support for buildinfo files / dbgsym packages In-Scope Open None
Other Operations 104352 Make scap able to depool/repool servers via the conftool API In-Scope Open None
Other Operations 164993 archiva artifact links point to 127.0.0.1 In-Scope Open None
Other Operations 165105 Wiley requests for DOI and some other publishers don't work in production In-Scope Open None
Other Operations 165136 Ferm rules for labstore NFS hosts In-Scope Open None
Other Operations 165170 rack/setup/install ores2001-2009 In-Scope Open None
Other Operations 165173 rack/setup/install dumpsdata100[12] In-Scope Open None
Other Operations 165323 Add Prometheus machine metric to track core dumps In-Scope Open None
Other Operations 165345 decommission indium In-Scope Open None
Other Operations 165348 Check long-running screen/tmux sessions In-Scope Open None
Other Operations 165511 Change automatic shortlink in blog theme In-Scope Open None
Other Operations 165519 rack and setup mw1307-1348 In-Scope Open None
Other Operations 165520 rack and setup wtp1025-1048 In-Scope Open None
Other Operations 103886 Translation cache exhaustion caused by changes to PHP code in file scope Screep Open None
Other Operations 165618 Audit / document reasons for not enabling HT? In-Scope Open None
Other Operations 165631 move gerrit.wm.org SSH service to private/behind LVS like phab-vcs In-Scope Open None
Other Operations 102575 document graphite failover/backfill procedures In-Scope Open None
Other Operations 101585 document redis upgrade/restart procedures In-Scope Open None
Other Operations 165779 rack/setup/install labnet100[34] In-Scope Open None
Other Operations 165781 rack/setup/install labcontrol100[34] In-Scope Open None
Other Operations 165784 rack/setup/install labmon1002 In-Scope Open None
Other Operations 165885 Create a cron to clean clientbucket every day or hour In-Scope Open None
Other Operations 101141 udp rcvbuferrors and inerrors on graphite1001 In-Scope Open None
Other Operations 166038 Sync internal nutcracker package with Debian package In-Scope Open None
Other Operations 166066 Integrate the puppet compiler in the puppet CI pipeline In-Scope Open None
Other Operations 166081 rack/setup/install conf1004-conf1006 In-Scope Open None
Other Operations 100777 expose hosts in maintenance state so we can prevent scap from running on them In-Scope Open None
Other Operations 166233 Update redis puppet class to support stretch In-Scope Open None
Other Operations 166291 Exim panics when spamd reaches maxchildren In-Scope Open None
Other Operations 166322 spam from phabricator in labs In-Scope Open None
Other Operations 166368 Wipe of spare/replacement disks In-Scope Open None
Other Operations 98984 Check power supply balance settings on cp3030+ In-Scope Open None
Other Operations 98831 Honor DNT header for access logs & varnish logs In-Scope Open None
Other Operations 166937 Broken /a/refinery-source/guard/run_all_guards.sh script on stat1002 In-Scope Open None
Other Operations 97909 Upgrade jobrunners to redis 2.8 In-Scope Open None
Other Operations 167035 stretch acct monthly cron will spam when /var/log/wtmp.1 doesn't exist In-Scope Open None
Other Operations 97635 Update diamond to latest upstream version Screep Open None
Other Operations 167091 Elasticsearch errors about BulkShardRequest In-Scope Open None
Other Operations 167104 Figure out how to disable starting of jobrunner/jobchron in the non-active DC In-Scope Open None
Other Operations 167121 Several hosts return "internal IPMI error" in the check_ipmi_temp check In-Scope Open None
Other Operations 167130 Decom mw1170-mw1179, and replace them with new systems. In-Scope Open None
Other Operations 167225 Upload hhvm to stretch apt repo in apt.wikimedia.org In-Scope Open None
Other Operations 167239 Redirect status.wikipedia.org to status.wikimedia.org In-Scope Open None
Other Operations 167245 prometheus-node-exporter - invalid group: ‘prometheus:prometheus' In-Scope Open None
Other Operations 167269 Make security updates of docker images manageable In-Scope Open None
Other Operations 167292 Collate jessie-wikimedia/backports into jessie-wikimedia/main In-Scope Open None
Other Operations 97524 ocg alarm ocg_job_status_queue 'flapping' In-Scope Open None
Other Operations 97204 RFC: Request timeouts and retries In-Scope Open None
Other Operations 95801 Allow customizing the alert message from graphite In-Scope Open None
Other Operations 167412 host-vmem.erb is doing operations that make no sense In-Scope Open None
Other Operations 167422 Monitoring: add link to graph for Icinga timeseries alarms In-Scope Open None
Other Operations 167549 Create Icinga alert when OSM replication lags on maps In-Scope Open None
Other Operations 167689 Add RIPE atlas data to Prometheus Screep Open None
Other Operations 167820 rack/setup/install labweb100[12].wikimedia.org In-Scope Open None
Other Operations 167845 Migrate zuul-server behind systemd service In-Scope Open None
Other Operations 95054 Move ircecho config file to be YAML In-Scope Open None
Other Operations 167966 Look into feasibility of disabling sha-1 host keys on our ssh daemons In-Scope Open None
Other Operations 167984 rack/setup/install labstore100[67].wikimedia.org In-Scope Open None
Other Operations 167992 rack/setup/install new kafka nodes kafka-jumbo100[1-6] In-Scope Open None
Other Operations 95053 ircecho should accept input via unix sockets In-Scope Open None
Other Operations 168044 jobrunner / jobchron systemd services are in error state after a stop In-Scope Open None
Other Operations 168110 Puppet CA: virt1000.wikimedia.org' will expire on 2017-08-15 In-Scope Open None
Other Operations 168403 Aggregate prometheus functions yielding different results in grafana vs. prometheus console In-Scope Open None
Other Operations 168407 rack/setup/install labnodepool1002.eqiad.wmnet In-Scope Open None
Other Operations 168445 Reboots of cloud servers In-Scope Open None
Other Operations 168460 Update certificates on productions replicas of corp.wikimedia.org LDAP In-Scope Open None
Other Operations 168490 upgrade planet instances to stretch In-Scope Open None
Other Operations 95052 Make ircecho much better In-Scope Open None
Other Operations 168562 Reimage gerrit2001 as stretch In-Scope Open None
Other Operations 168613 Broken disk on mw1228 In-Scope Open None
Other Operations 168619 Degraded RAID on lvs3001 In-Scope Open None
Other Operations 94951 Enable the usage of `hhvm -m debug --debug-host ::1` from mw1017 so developers can step through code (think gdb) in production to see what is going wrong. In-Scope Open None
Other Operations 168765 Create Wikiversity Hindi In-Scope Open None
Other Operations 168767 Monitor PostgreSQL connection slots In-Scope Open None
Other Operations 168816 some elasticsearch servers in eqiad have CPU overheating In-Scope Open None
Other Operations 168891 rack/setup/install labtestmetal2001.codfw.wmnet In-Scope Open None
Other Operations 94819 Audit racktables In-Scope Open None
Other Operations 168967 Upload shiny-server .deb to our Jessie apt repository In-Scope Open None
Other Operations 169035 bast3002 sdb broken In-Scope Open None
Other Operations 94329 secure Cassandra/RESTBase cluster In-Scope Open None
Other Operations 169246 Stress/capacity test new ores* cluster Screep Open None
Other Operations 169249 /usr/local/bin/xenon-generate-svgs and flamegraph.pl cronspam In-Scope Open None
Other Operations 169286 labstore1005 A PCIe link training failure error on boot In-Scope Open None
Other Operations 169287 etcd config depends on puppet certs, but puppet doesn't know In-Scope Open None
Other Operations 169290 New anti-stackclash (4.9.25-1~bpo8+3 ) kernel super bad for NFS In-Scope Open None
Other Operations 169318 Use multiple puppetdbs on puppet masters In-Scope Open None
Other Operations 169322 Research whether it makes sense to have OTRS installation in an HA setup In-Scope Open None
Other Operations 94277 Convert snapshot hosts to use HHVM and trusty In-Scope Open None
Other Operations 172710 send wdqs logs to logstash Screep Open None
Other Operations 172712 fix librenms LE check for netmon2001 Screep Open None
Other Operations 169518 Decommission esams ms-fe / ms-be Screep Open None
Other Operations 172735 Create 'pagecompilation' Swift account(s) (beta + prod) for Readers offline article compilations project Screep Open None
Other Operations 169548 Prepare for Puppet 4 Screep Open None
Other Operations 169564 MD RAID: remove mdadm daily check Screep Open None
Other Operations 169570 nfs-manage failover script needs to be tested with real load and fixed Screep Open None
Other Operations 94215 decommission cp3001 & cp3002 In-Scope Open None
Other Operations 93531 secure.wikimedia.org entries still showing up in Google search results In-Scope Open None
Other Operations 93138 Procure hardware for Sentry In-Scope Open None
Other Operations 92471 enable authenticated access to Cassandra JMX In-Scope Open None
Other Operations 169658 Improve database backups' coverage, monitoring and data recovery time (part 1) (tracking) Screep Open None
Other Operations 169680 NFS on dataset1001 overloaded, high load on the hosts that mount it Screep Open None
Other Operations 169763 Upload nodejs 6.x to stretch-wikimedia Screep Open None
Other Operations 169849 Architecture and puppetize setup for dumpsdata boxes Screep Open None
Other Operations 169884 Jobrunners generate mediawiki exceptions upon calling Closure$RecentChange::save Screep Open None
Other Operations 91404 Setup backups of elasticsearch indices In-Scope Open None
Other Operations 172798 allow wdqs-admins to pool / depool wdqs servers Screep Open None
Other Operations 169937 Services Q1 2017/18 goal: Begin migrating job queue processing to multi-DC enabled eventbus infrastructure. Screep Open None
Other Operations 169939 End of August milestone: Cassandra 3 cluster in production Screep Open None
Other Operations 169969 Regularly purge old ores graphite metrics Screep Open None
Other Operations 170108 Operations Q1 goal: Streamlined Service Delivery Screep Open None
Other Operations 170111 Implement a pod networking policy approach Screep Open None
Other Operations 170119 Upgrade to kubernetes >=1.5 Screep Open None
Other Operations 170120 Standardize on the "default" pod setup Screep Open None
Other Operations 170121 Experiment with ingress solutions (stretch) Screep Open None
Other Operations 89887 Clean up permissions for privatedata files on stat1002 - they should be group readable by statistics-privatedata-users Screep Open None
Other Operations 89829 bond eth interfaces on ms1001 In-Scope Open None
Other Operations 170150 Evaluate Grafana's LDAP group options and deprecate grafana-admin if possible Screep Open None
Other Operations 170152 mc2023 / mc2025 fail to mount root partition within 90 seconds using Linux 4.9 Screep Open None
Other Operations 172809 Degraded RAID on analytics1055 Screep Open None
Other Operations 89808 wikitech instances list is blank In-Scope Open None
Other Operations 88997 Improve graphite failover In-Scope Open None
Other Operations 88730 Nutcracker needs to automatically recover from MC failure - rebalancing issues In-Scope Open None
Other Operations 170298 sshd stretch puppet support Screep Open None
Other Operations 170353 Icinga: timeseries checks should have the link to a graph with the data Screep Open None
Other Operations 170365 move legal-tm-vio alias to OIT Screep Open None
Other Operations 87790 decom amslvs1-4 (dc work) In-Scope Open None
Other Operations 170453 FY2017/18 Program 6: Streamlined Service delivery Screep Open None
Other Operations 170456 FY2017/18 Program 6 - Outcome 2 - Objective 3: Integrated, container-based development environment Screep Open None
Other Operations 170474 Decommisson and store old row D network gear. Screep Open None
Other Operations 170480 FY2017/18 Program 6 - Outcome 2: Developers are able to develop and test their applications through a unified pipeline towards production deployment. Screep Open None
Other Operations 170481 FY2017/18 Program 6 - Outcome 2 - Objective 2: Set up a continuous integration and deployment pipeline Screep Open None
Other Operations 172815 Improve stability and maintainability of our browser-based PDF render service Screep Open None
Other Operations 170510 unaccepted salt keys Screep Open None
Other Operations 87220 Minimize differences between beta and production (Tracking) In-Scope Open None
Other Operations 170548 nodejs 6.11 Screep Open None
Other Operations 170628 HTTP 429 on thumbnail images for specific SVG file on Commons Screep Open None
Other Operations 170640 reports.frdev.wm.o -- still in use? Screep Open None
Other Operations 170740 PuppetDB misbehaving on 2017-07-15 Screep Open None
Other Operations 170817 Upgrade Thumbor servers to Stretch Screep Open None
Other Operations 86552 monitor and alarm on SMART attributes In-Scope Open None
Other Operations 86546 graphite-web logs are not rotated In-Scope Open None
Other Operations 172891 Access for new Research Scientist: Diego Saez Screep Open None
Other Operations 170995 Setup a mirror for R language dependencies (CRAN) Screep Open 10.0
Other Operations 86081 Complete the use of HHVM over Zend PHP on the Wikimedia cluster In-Scope Open None
Other Operations 85451 scale graphite deployment (tracking) In-Scope Open None
Other Operations 171048 Eventbus does not handle gracefully changes in DNS recursors Screep Open None
Other Operations 171122 librenms: consider using Distributed Poller with multiple netmon servers Screep Open None
Other Operations 84700 Setup management switch in OE12 In-Scope Open None
Other Operations 171157 Monitor internal CA expirations Screep Open None
Other Operations 171166 Build and push a new hhvm-luasandbox package Screep Open None
Other Operations 84163 Fix CirrusSearch monitoring In-Scope Open None
Other Operations 83729 Fix monitoring of poolcounter service Screep Open None
Other Operations 171188 Move the main WMCS puppetmaster into the Labs realm Screep Open None
Other Operations 171191 Should puppet auto-restart slapd? Screep Open None
Other Operations 171210 rack/setup/install wdqs100[45].eqiad.wmnet Screep Open None
Other Operations 171392 Some Commons pages transcluding Template:Countries_of_Europe HTTP 500/503 when accessed from non-English languages specified in the template Screep Open None
Other Operations 171452 Integrate jessie 8.9 point release Screep Open None
Other Operations 171453 Integrate stretch 9.1 point release Screep Open None
Other Operations 76306 Set warning thresholds for average cluster utilization In-Scope Open None
Other Operations 172921 Nrpe command_timeout and "Service Check Timed Out" errors Screep Open None
Other Operations 171482 Programmatic generation of grafana dashboards Screep Open None
Other Operations 171490 mendelevium (otrs) running out of inodes Screep Open None
Other Operations 76203 Make ircecho run as its own user In-Scope Open None
Other Operations 171584 failing RAID disk on frdb2001 Screep Open None
Other Operations 171619 ORES should use git-fat for binaries Screep Open None
Other Operations 171623 Split up labstore external shelf storage available in codfw between labstore2001 and 2 Screep Open None
Other Operations 171626 rack/setup/install druid100[456].eqiad.wmnet Screep Open None
Other Operations 171704 Switch all hosts to the future parser Screep Open None
Other Operations 171707 Upgrade kartotherian and tilerator to nodejs 6.11 Screep Open None
Other Operations 69015 m.wikipedia.org incorrectly redirects to en.m.wikipedia.org In-Scope Open None
Other Operations 172930 Long running thumbnail requests locking up Thumbor instances Screep Open None
Other Operations 171745 nscd does not cache localhost causing high CPU usage when localhost is often resolved Screep Open None
Other Operations 171758 Simplify git-fat support for pulling from both production and labs Screep Open None
Other Operations 171786 Switch to new labs puppetmasters Screep Open None
Other Operations 67394 [EPIC] Performance testing environment In-Scope Open None
Other Operations 67270 Default license for operations/puppet In-Scope Open None
Other Operations 171851 Reimage ores* hosts with Debian Stretch Screep Open None
Other Operations 171923 thorium - failed git clone of geowiki-data-private Screep Open None
Other Operations 171958 Requesting access to mwlog1001.eqiad.wmnet for goransm Screep Open None
Other Operations 64987 librsvg misinterpret quoted font family names that contain whitespaces In-Scope Open None
Other Operations 56713 Non-NDA users cannot access graphite.wikimedia.org In-Scope Open None
Other Operations 56515 Apply editing rate limits for all users In-Scope Open None
Other Operations 55457 setup a DB backed parser cache In-Scope Open None
Other Operations 50029 crackling at start of OGG renditions of MIDI files (fixed in TiMidity++ 2.14.0) Screep Open None
Other Operations 46791 [[wikitech:Server_admin_log]] should not rely on freenode irc for logmsgbot entries In-Scope Open None
Other Operations 46016 SVG fails to render properly due to several issues In-Scope Open None
Other Operations 172149 Consider a lower virtual node count Screep Open None
Other Operations 40860 security@mediawiki.org : Create a public key and publish it on the public key servers In-Scope Open None
Other Operations 172217 Wikimania needs hosting on a server for onsite conference guide Screep Open None
Other Operations 173056 Import Wiki Loves Monuments photos from Flickr to Commons Screep Open None
Other Operations 36947 Incorrect text positioning in SVG rasterization (scale/transform; font-size; kerning) In-Scope Open 0.0
Other Operations 32716 Run our own Tor client for Tor block In-Scope Open None
Other Operations 17000 Special:Import error: "Import failed: Could not open import file" In-Scope Open None
Other Operations 172479 Collect error logs from jobchron/jobrunner services in Logstash Screep Open None
Other Operations 172538 rack/setup/install labvirt10(19|20).eqiad.wmnet Screep Open None
Other Operations 172602 lists.wikimedia.org (208.80.154.21) blocked by Trend Micro Screep Open None
Other Operations 126281 [Regression] stats.wikipedia.org redirect no longer works ("Domain not served here") In-Scope Open None
Other Operations 126295 Spike: What do we have to package to run the Programs and Events dashboard on production? In-Scope Open None
Other Operations 126574 puppet should try to mount all mountable swift filesystems In-Scope Open None
Other Operations 126619 cassandra slow streaming during (de)commission In-Scope Open None
Other Operations 126989 MediaWiki logging & encryption In-Scope Open None
Other Operations 127054 pinentry-gtk2 pulls in a lot of unneeded Gnome/GTK libs In-Scope Open None
Other Operations 126158 [RFC] Alert about *when* partitions will run out of space, not a percentage/absolute number In-Scope Open None
Other Operations 126083 overhaul labstore setup [tracking] In-Scope Open None
Other Operations 125752 setup/deploy sarin(WMF5851) as a salt master in codfw In-Scope Open None
Other Operations 127549 move travel related aliases to OIT In-Scope Open None
Other Operations 127550 status of studentgroups@ and studentclubs@ mail aliases? In-Scope Open None
Other Operations 125735 Warning: timed out after 0.2 seconds when connecting to rdb1001.eqiad.wmnet [110]: Connection timed out In-Scope Open None
Other Operations 127797 document all puppet classes / defined types!? In-Scope Open None
Other Operations 127825 Re-add intel-microcode In-Scope Open None
Other Operations 125442 es2009 degraded RAID In-Scope Open None
Other Operations 125411 Diamond load averages do not contain scaled versions In-Scope Open None
Other Operations 125085 Split the API MediaWiki appserver pool into two external/internal pools In-Scope Open None
Other Operations 125015 Requests to (hard) redirect pages return their target's contents but are counted as pageviews to the redirect page In-Scope Open None
Other Operations 128590 Cassandra uses default ip address for outbound packets while bootstrapping In-Scope Open None
Other Operations 128615 Get rid of Tool Labs home page check from shinken In-Scope Open None
Other Operations 128715 Add other Tools administrators to the Icinga notification group In-Scope Open None
Other Operations 128716 Make icinga-wm report Tools homepage check at #wikimedia-labs, too In-Scope Open None
Other Operations 129180 Preserve SSH host key when re-imaging hosts In-Scope Open None
Other Operations 129188 mw2212 unresponsive In-Scope Open None
Other Operations 129224 on labcontrol1001, /var/cache/salt has too many files! In-Scope Open None
Other Operations 129621 "internal_api_error_MWException: [dbf916b7] Exception Caught: Could not acquire lock for" for some uploads (during upload with Pywikibot OAuth) In-Scope Open None
Other Operations 124991 evaluate possibility for nscd use with useldap In-Scope Open None
Other Operations 124646 Salt minions randomly crashing when the deployment server grain gets changed In-Scope Open None
Other Operations 129841 Many minions fail to connect to salt master since 10:39 In-Scope Open None
Other Operations 129847 conftool-merge should report which node is setting attributes for In-Scope Open None
Other Operations 129963 Update memcached package and configuration options In-Scope Open None
Other Operations 130209 Collect threaddumps from elasticsearch at regular intervals In-Scope Open None
Other Operations 130512 Grafana: Job Queue Health: Panel is displayed incorrectly In-Scope Open None
Other Operations 130590 Have dedicated master nodes for elasticsearch In-Scope Open None
Other Operations 130593 investigate slapd memory leak In-Scope Open None
Other Operations 130617 Collect metrics on pool counter usage In-Scope Open None
Other Operations 130709 authoritative copy of 'root' files for upload.wikimedia.org is only in swift In-Scope Open None
Other Operations 124413 confctl should provide tags information after writing data In-Scope Open None
Other Operations 131326 smokeping config puppetization issue? In-Scope Open None
Other Operations 131748 Refresh the appservers puppet code/configs In-Scope Open None
Other Operations 131832 Unable to restore file that has a very large file size In-Scope Open None
Other Operations 131928 Upgrade jessie systems from Linux 3.19 to 4.4 In-Scope Open None
Other Operations 124185 Evaluate alternative web interfaces to icinga 1 core In-Scope Open None
Other Operations 131966 Default gateway unreachable on baham.wikimedia.org after reboot In-Scope Open None
Other Operations 132104 Consider moving policy.wikimedia.org away from WordPress.com In-Scope Open None
Other Operations 132216 Setting up bulk proxies pointing to a multiwiki mediawiki-vagrant setup running on a labs vm In-Scope Open None
Other Operations 132256 Analytics hosts showed high temperature alarms In-Scope Open None
Other Operations 132324 Tracking and Reducing cron-spam from root@ In-Scope Open None
Other Operations 132325 Weak digest algorithm (SHA1) used to sign InRelease on apt.wikimedia.org In-Scope Open None
Other Operations 124179 Improve access to and control over incident and metrics monitoring infrastructure In-Scope Open None
Other Operations 132532 rsync module doesnt work on trusty In-Scope Open None
Other Operations 124101 Specific revisions of multiple files missing from Swift - 404 Not Found returned In-Scope Open None
Other Operations 132632 puppetize turning off reserved space for cassandra /srv Screep Open None
Other Operations 132856 Write documentation on how / when to use custom Diamond metrics collectors In-Scope Open None
Other Operations 123918 'swift' user/group IDs should be consistent across the fleet In-Scope Open None
Other Operations 133091 Highest SSTables / read thresholds In-Scope Open None
Other Operations 133093 Investigate idle appservers in codfw In-Scope Open None
Other Operations 123818 setup YubiHSM and laptop at office In-Scope Open None
Other Operations 133164 Document eqiad/codfw transition plan for OCG In-Scope Open None
Other Operations 133179 Redis monitoring needs to be improved In-Scope Open None
Other Operations 133318 High levels of PoolCounter errors should trigger alerts In-Scope Open None
Other Operations 123809 Module uwsgi doesn't allow passing multiple config params of same name In-Scope Open None
Other Operations 133392 save grafana dashboards in revision control / puppet In-Scope Open None
Other Operations 123560 investigate rsync between dcs with encryption In-Scope Open None
Other Operations 133476 Proposal: Centralize OTRS login methodology In-Scope Open None
Other Operations 123276 https://test.wikipedia.org/wiki/Bug%3F?action=history doesn't show the history page, unlike https://test.wikipedia.org/w/index.php?title=Bug%3F&action=history In-Scope Open None
Other Operations 133643 Upstream our Diamond PowerDNSRecursorCollector In-Scope Open None
Other Operations 133656 Have a paging check for Nova API accessible In-Scope Open None
Other Operations 133674 HHVM is leaking memory on the API appservers In-Scope Open None
Other Operations 123237 Provide production jessie image with node 4.2; use this for service-runner build command In-Scope Open None
Other Operations 133744 Epic: switch Maps to production status In-Scope Open None
Other Operations 133844 Improve Elasticsearch icinga alerting In-Scope Open None
Other Operations 123106 PNG thumbnail preview of SVG misses some text In-Scope Open None
Other Operations 133913 Completely port l10nupdate to scap In-Scope Open None
Other Operations 134237 Graphoid returns a 400 on MW API time-out In-Scope Open None
Other Operations 134271 Replace ircd-ratbox with something newer/maintained In-Scope Open None
Other Operations 134326 udpmxircecho should write stats of messages processed and we should alert when that drops to zero In-Scope Open None
Other Operations 122917 Provide a good download service of dumps from Wikimedia In-Scope Open None
Other Operations 122825 Service Ownership and Maintenance In-Scope Open None
Other Operations 134458 status.wikimedia.org should use some Wikimedia favicon if possible In-Scope Open None
Other Operations 134551 Create functional cluster checks for all services (and have them page!) In-Scope Open None
Other Operations 134811 Consider REST with SSL (HyperSwitch/Cassandra) for session storage In-Scope Open None
Other Operations 134875 udpmxircecho spam/not working if unable to connect to irc server In-Scope Open None
Other Operations 122210 Security audit for tftp on Carbon In-Scope Open None
Other Operations 135113 Rationalize our jobqueues redis topology In-Scope Open None
Other Operations 135122 Reduce etcd technical debt In-Scope Open None
Other Operations 135124 Deploy etcddump (or another etcd dump & load tool) to production In-Scope Open None
Other Operations 135125 Install a second etcd cluster in codfw In-Scope Open None
Other Operations 135128 Turn on etcd TLS for intra-cluster communications In-Scope Open None
Other Operations 135318 Document how to handle 'inconsistent state within the internal storage backends' issues In-Scope Open None
Other Operations 135338 On Trusty and Jessie PHP yields: PHP Deprecated: Comments starting with '#' are deprecated in /etc/php5/cli/conf.d/20-xhprof.ini on line 2 In-Scope Open None
Other Operations 135385 investigate carbon-c-relay stalls/drops towards graphite2002 In-Scope Open None
Other Operations 135595 mod_deflate + mod_uwsgi causing mangled apache responses In-Scope Open None
Other Operations 135723 Restarts of ganglia-monitor are unreliable In-Scope Open None
Other Operations 122144 Move most (all?) exim personal aliases to OIT In-Scope Open None
Other Operations 135991 Automated service restarts for common low-level system services In-Scope Open None
Other Operations 136094 Race condition in setting net.netfilter.nf_conntrack_tcp_timeout_time_wait In-Scope Open None
Other Operations 136311 Monitor the BMC's event log for hardware errors In-Scope Open None
Other Operations 136312 encrypt syslog traffic In-Scope Open None
Other Operations 136403 Move cp3030+ from OE14 to OE13 in racktables In-Scope Open None
Other Operations 136429 [EPIC] Migrate base image to Debian Jessie In-Scope Open None
Other Operations 136562 Audit/fix hosts with no RAID configured In-Scope Open None
Other Operations 136603 Update limit.sh to support systemd-based cgroup management In-Scope Open None
Other Operations 136702 Increase time before alert for elasticsearch disk space issues In-Scope Open None
Other Operations 122069 jobrunner memory leaks In-Scope Open None
Other Operations 121610 system users with UIDs > 500 In-Scope Open None
Other Operations 137176 catch-all apache vhost on the cluster should return 404 for non-existing sites In-Scope Open None
Other Operations 137217 Clean up apt:pin of python modules used for Nodepool In-Scope Open None
Other Operations 137229 Tune thread for osm2pgsql / postgres max connections for Maps In-Scope Open None
Other Operations 121583 setup/deploy tegmen/WMF6381 as monitoring host In-Scope Open None
Other Operations 137397 revisit swift (sys)logging In-Scope Open None
Other Operations 137616 Epic: cultivating the Maps garden In-Scope Open None
Other Operations 137791 libcglib3-java replaces libcglib-java in Jessie In-Scope Open None
Other Operations 137939 Increase frequency of OSM replication In-Scope Open None
Other Operations 121240 Network isolation for production and semi-production services In-Scope Open None
Other Operations 121105 Mails from MediaWiki seem to get (partially) lost In-Scope Open None
Other Operations 138017 Improve automation around Maps servers In-Scope Open None
Other Operations 120943 Wikimania 2017 site does not automatically redirect to mobile site, when opening from a mobile device Screep Open None
Other Operations 138136 MB Lateefi Fonts for Sindhi Wikipedia. In-Scope Open None
Other Operations 138496 bring swift eqiad to one zone per row In-Scope Open None
Other Operations 120856 Remove all out of warranty unused cp10xx's from A2 In-Scope Open None
Other Operations 138685 notebook1001 shown as DOWN in icinga, due to firewall rules In-Scope Open None
Other Operations 138758 diamond: certain counters always calculated as 0 In-Scope Open None
Other Operations 138799 Create a simple puppet role for setting up a singlenode kubernetes install In-Scope Open None
Other Operations 138821 extend existing graphite whisper files retention to five years In-Scope Open None
Other Operations 138866 Update & standardize Platform-specific_documentation for HP servers In-Scope Open None
Other Operations 139971 access_new_install role vs. Labs vs. the future In-Scope Open None
Other Operations 140075 investigate swift used space spikes since June 2016 In-Scope Open None
Other Operations 140141 Install mscorefonts on scaling servers for SVG rendering In-Scope Open None
Other Operations 140270 Determine a core set or a checklist of permissions for deployment purpose In-Scope Open None
Other Operations 140316 Add granularity limiter (g=) to wikimedia.org DKIM record(s) In-Scope Open None
Other Operations 140442 reinstall rdb100[56] with RAID In-Scope Open None
Other Operations 140536 Thumbnails of some specific images show unwanted black lines In-Scope Open None
Other Operations 140594 svn.wikimedia.org redirects to Diffusion main page, hence hard to find e.g. "flexbisonparse" In-Scope Open None
Other Operations 140813 Protect sensitive user-related information with a UserData / auth / session service In-Scope Open None
Other Operations 140879 503 error raises again while trying to load a Wikidata page In-Scope Open None
Other Operations 140942 Tracking: Monitoring and alerts for "business" metrics In-Scope Open None
Other Operations 141038 implement icinga paging for non-ops teams In-Scope Open None
Other Operations 141128 determine/process/document bios firmware tracking/updating policies In-Scope Open None
Other Operations 141186 Map caches metrics look broken In-Scope Open None
Other Operations 120585 Make l10nupdate user a system user In-Scope Open None
Other Operations 120532 Use user-specific passwords for accessing EventLogging database In-Scope Open None
Other Operations 141520 "MediaWiki exceptions and fatals per minute" alarm is too slow (half an hour delay!) In-Scope Open None
Other Operations 141524 eventbus should send statsd in batches In-Scope Open None
Other Operations 141704 Storage backend errors on commons when deleting/restoring pages In-Scope Open None
Other Operations 141756 audit / test / upgrade hp smartarray P840 firmware In-Scope Open None
Other Operations 141783 Add monitoring for detecting when logstash services are down In-Scope Open None
Other Operations 141897 Review new service 'pre-deployment to production' checklist In-Scope Open None
Other Operations 141959 Moving network::external to hiera broke much of labs In-Scope Open None
Other Operations 142002 Clean up puppet & configs for ORES In-Scope Open None
Other Operations 142205 use granularity (g=) restrictions for wikimedia.org fundraising DKIM records In-Scope Open None
Other Operations 142815 Enhance account handling (meta bug) In-Scope Open None
Other Operations 142821 Synchronise groups defined in data.yaml to LDAP In-Scope Open None
Other Operations 142827 Enforce reference to Phabricator task for all commits to modules/admin/data/data.yaml In-Scope Open None
Other Operations 142984 Review lists of config/sysctl recommendations by "kernel self-protection project" In-Scope Open None
Other Operations 142991 Enable "upload by url" feature at zhwiki In-Scope Open None
Other Operations 143552 Make elasticsearch configuration more robust to loss of network connectivity In-Scope Open None
Other Operations 143556 Setting up grafana should also setup Anonymous read-only access for the default org In-Scope Open None
Other Operations 120377 labmon1001 graphite instance archiver keeps archiving the same instances In-Scope Open None
Other Operations 143931 Update ICU version to 55.1 In-Scope Open None
Other Operations 144006 Move the MW Beta appservers to Debian In-Scope Open None
Other Operations 120165 Implement role based hiera lookups for labs In-Scope Open None
Other Operations 120159 Phase out the 'puppet' module with fire, make self hosted puppetmasters use the puppetmaster module In-Scope Open None
Other Operations 144431 RESTBase k-r-v as Cassandra anti-pattern In-Scope Open None
Other Operations 144479 Ensure thumbor container access is preserved by mw filebackend setzoneaccess In-Scope Open None
Other Operations 144539 Remove /srv/deployment/wdqs/wdqs/rules.log symlink In-Scope Open None
Other Operations 119846 Redirect revisions from svn.wikimedia.org to https://phabricator.wikimedia.org/rSVN In-Scope Open None
Other Operations 144933 Cleanup debconf handling in mailman puppet setup In-Scope Open None
Other Operations 145065 Decrease time required to fully restart the Cirrus elasticsearch clusters In-Scope Open None
Other Operations 145293 E-mails not being received by OTRS In-Scope Open None
Other Operations 145659 Port application-specific metrics from ganglia to prometheus In-Scope Open None
Other Operations 119719 Enforce a minimum refresh period for grafana dashboards hitting graphite In-Scope Open None
Other Operations 145742 Migrate video scalers to jessie In-Scope Open None
Other Operations 146090 High failure rate of account creation should trigger an alarm / page people In-Scope Open None
Other Operations 146285 Switch mwscript from Zend PHP5 to default php alternative (egHHVM) In-Scope Open None
Other Operations 119718 Make it easier to ban misbehaving dashboards from graphite In-Scope Open None
Other Operations 146355 Replace etcd internal auth mechanism with a frontend proxy In-Scope Open None
Other Operations 119679 Rewrite http://download.wikimedia.org/mediawiki/ -> https://releases.wikimedia.org/mediawiki in less than 3 redirects In-Scope Open None
Other Operations 119660 Set up LVS for labs dns recursors In-Scope Open None
Other Operations 146627 Make deployment-prep puppetmaster more similar to Production puppetmaster In-Scope Open None
Other Operations 146657 create notifications about user accounts that have not been used for a long time In-Scope Open None
Other Operations 146664 Limit resources used by ORES In-Scope Open None
Other Operations 146841 Reach out to Google about @yahoo.com emails not reaching gmail inboxes (when sent to mailing lists) In-Scope Open None
Other Operations 146914 grain-ensure erroneous mismatch with (bool)True vs (str)true In-Scope Open None
Other Operations 146968 OTRS spam classification methods and systems In-Scope Open None
Other Operations 147040 Two recently uploaded files have disappeared (404) In-Scope Open None
Other Operations 119401 Untangle labs/production roles from labs/instance roles In-Scope Open None
Other Operations 147204 Update confd package In-Scope Open None
Other Operations 147366 Setup automated topk wide row reporting Screep Open None
Other Operations 147872 Rename rhodium to puppetmaster1003 In-Scope Open None
Other Operations 147905 investigate lead hardware issue In-Scope Open None
Other Operations 147923 Extract metrics from logs In-Scope Open None
Other Operations 148017 lvs2002 repeated usb connect/disconnect message In-Scope Open None
Other Operations 148048 Store Wikimedia unified account name (SUL) in LDAP directory In-Scope Open None
Other Operations 148061 Feasibility of hosting podcast setup on Wikimedia servers In-Scope Open None
Other Operations 119274 Check incoming requests to secure.wm.o In-Scope Open None
Other Operations 118829 Automate the provisioning and management of MediaWiki clusters In-Scope Open None
Other Operations 118812 Investigate mysterious_sysctl settings and figure out what to do with them In-Scope Open None
Other Operations 148567 Restrict outgoing network connections from Electron render service In-Scope Open None
Other Operations 148614 Icinga check for Tor In-Scope Open None
Other Operations 148637 Port redis statistics from ganglia to prometheus In-Scope Open None
Other Operations 148647 refresh swift hardware in codfw/eqiad In-Scope Open None
Other Operations 148693 Deploy IDS rendering engine to production In-Scope Open None
Other Operations 148843 GPU upgrade for stats machine In-Scope Open None
Other Operations 148968 Build Kubernetes for production use In-Scope Open None
Other Operations 118746 Goal: Strengthen Incident monitoring infrastructure In-Scope Open None
Other Operations 118380 slow salt-call invocation on minions In-Scope Open None
Other Operations 148986 Firewall sets not being loaded post-reboot due to a @resolve race on jessie In-Scope Open None
Other Operations 149057 Designate seems very slow to delete records? In-Scope Open None
Other Operations 149180 Trebuchet targets for test/testrepo are out of date In-Scope Open None
Other Operations 149287 Heating alerts for mw servers in eqiad In-Scope Open None
Other Operations 149421 Long running mediawiki web requests impacts service availability, specially databases In-Scope Open None
Other Operations 149543 Setup PAWS internal experimentally on notebook* nodes In-Scope Open None
Other Operations 149589 Puppet tab in Horizon unusably slow In-Scope Open None
Other Operations 149617 Integrating MediaWiki (and other services) with dynamic configuration In-Scope Open None
Other Operations 149804 Review of ferm services without srange In-Scope Open None
Other Operations 149845 Something is wrong with installer root disk stuff In-Scope Open None
Other Operations 118331 Alert when used_memory gets too high for redis queues In-Scope Open None
Other Operations 118154 determine hardware needs for dumps in eqiad and codfw In-Scope Open None
Other Operations 149885 Investigate Swift as a storage backend for maps tiles In-Scope Open None
Other Operations 150020 Refactor puppet-postgresql module to use custom types In-Scope Open None
Other Operations 117673 labs precise and jessie instance not accessible after provisioning In-Scope Open None
Other Operations 150108 fix partition scheme for logstash ingester hosts In-Scope Open None
Other Operations 150185 Deploy ElectronPdfService Extension to production In-Scope Open None
Other Operations 150206 ms-be1016 controller cache failure In-Scope Open None
Other Operations 117508 Make ops-l a list for humans again (no cheating) In-Scope Open None
Other Operations 116951 Reprepro should bail if it can't read and sign using the root keys In-Scope Open None
Other Operations 150300 icinga notification if elevated writing to badpass.log In-Scope Open None
Other Operations 150356 Wikidata Query Service is overly verbose toward logstash In-Scope Open None
Other Operations 150396 Phabricator leaving old files in /tmp In-Scope Open None
Other Operations 150460 Configure maps cluster to send statsd metrics to the statsd endpoint in the same datacenter In-Scope Open None
Other Operations 150466 publish kartotherian / tilerator metrics by cluster In-Scope Open None
Other Operations 116805 DomainKeys Identified Mail (DKIM) for phabricator.wikimedia.org In-Scope Open None
Other Operations 150486 Deploy federation for Prometheus In-Scope Open None
Other Operations 150532 Upgrade qemu on ganeti clusters to 2.7 In-Scope Open None
Other Operations 150651 Information missing from racktables In-Scope Open None
Other Operations 150672 Provide a /parsoid directory on releases.wikimedia.org In-Scope Open None
Other Operations 116767 limit the impact of heavy/large graphite queries In-Scope Open None
Other Operations 150771 Secondary production Jenkins for CI In-Scope Open None
Other Operations 150811 Evaluate ScyllaDB as a near-term replacement to Cassandra In-Scope Open None
Other Operations 150822 Internal PKI for secure communication - Barcelona Ops offsite 2016 In-Scope Open None
Other Operations 150823 Puppet CA rollover In-Scope Open None
Other Operations 150871 [EPIC] (Proposal) Replicate core OCG features and sunset OCG service In-Scope Open None
Other Operations 150872 Replace OCG in collection extension with Electron In-Scope Open None
Other Operations 150874 Collate wikimedia pages into a single html wikimedia page that can then be rendered into a single pdf In-Scope Open None
Other Operations 150875 Confirm attribution needs In-Scope Open None
Other Operations 150912 Class 'Memcached' not found when running mwscript eval.php on debug servers In-Scope Open None
Other Operations 150917 Remove deprecated features from book creator UI In-Scope Open None
Other Operations 151009 Provide authenticated access to Prometheus native web interface In-Scope Open None
Other Operations 151045 Extending Yubico 2FA for production use (meta bug) In-Scope Open None
Other Operations 151046 Fully puppetise yubikey-val In-Scope Open None
Other Operations 151047 Integrate Yubikey into data.yaml In-Scope Open None
Other Operations 151048 Icinga monitoring for Yubikey components In-Scope Open None
Other Operations 151049 Run systematic availability tests In-Scope Open None
Other Operations 151050 Proper documentation for Yubico 2FA for production use In-Scope Open None
Other Operations 151273 lvs4002 power supply failure In-Scope Open None
Other Operations 151275 cp4008 and cp4012 running on single PSU In-Scope Open None
Other Operations 151304 tmpreaper possible race condition In-Scope Open None
Other Operations 151310 create-dbusers service failing on labstore1004 In-Scope Open None
Other Operations 151314 logrotate failing with $FILE.1.gz: File exists In-Scope Open None
Other Operations 151317 stat user crontab on stat hosts for old file removal In-Scope Open None
Other Operations 151322 labstore systemd state Icinga alarms In-Scope Open None
Other Operations 151486 Silver anomalies In-Scope Open None
Other Operations 151489 silver: /dev/md2 mounted twice In-Scope Open None
Other Operations 151493 silver: / partition low on space In-Scope Open None
Other Operations 151554 Track incoming HTTP request count on the Thumbor boxes In-Scope Open None
Other Operations 151632 Fix Icinga checks for test/decom servers In-Scope Open None
Other Operations 116750 2FA for SSH access to the production cluster In-Scope Open None
Other Operations 151702 API cluster failure / OOM In-Scope Open None
Other Operations 152073 Check concurrency/retry/timeout limits and syncronize those between services In-Scope Open None
Other Operations 152078 Load balancing "external" traffic to the Kubernetes cluster in production In-Scope Open None