Conversation

Notices

Embed this notice
Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:47 JST Bianca Kastl

#Crowdstrike huh? Was lernen wir daraus?
Gucken wir uns erst mal den sehr guten englischen Wikipedia Artikel an, um zu wissen, was da falsch lief… https://en.wikipedia.org/wiki/2024_CrowdStrike_incident
In conversation about 2 months ago from mastodon.social permalink
Attachments
1. Domain not in remote thumbnail source whitelist: upload.wikimedia.org
  
  2024 CrowdStrike incident
  
  On 19 July 2024, a faulty update to security software produced by CrowdStrike, an American cybersecurity company, caused widespread problems as computers and virtual machines running Microsoft Windows crashed and were unable to properly restart. Businesses and governments around the globe were affected in what has been described as the largest outage in the history of information technology and "historic in scale". Among the disrupted industries were airlines, airports, banks, hotels, hospitals, manufacturing, stock markets, and broadcasting. Governmental services, such as emergency services, and websites, were also affected. The error was discovered and a fix was produced on the same day but requires manual application to affected systems. This has resulted in a slow rate of recovery and lingering outages on many services. CrowdStrike produces a suite of security software products for businesses, designed to protect computers from cyberattacks. The Falcon Sensor product installs an endpoint sensor at the operating system level on individual computers...
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:42 JST Bianca Kastl
  in reply to
  
  Es wird immer besser. Oder Hahnebüchen.
  https://www.heise.de/hintergrund/Fataler-Fehler-bei-CrowdStrike-Schuld-war-ein-Null-Pointer-9807896.html?
  Also: Ein Anbieter von Sicherheitslösungen für Millionen Kunden pusht einen Speicheradressierungsfehler in C++ (jetzt keine Diskussion ob das sinnvoll ist, von der Sprache her) weltweit.
  Das ist eigentlich nahezu unmöglich, dass das passiert, also angenommen, dass dein Unternehmen sowas wie ne mehrstufige Qualitätssicherung hat.
  Scheint Crowdstrike aber halt null pointer drauf zu kennen.
  
  In conversation about 2 months ago permalink
  
  GreenSkyOverMe (Monika) repeated this.
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:42 JST Bianca Kastl
  in reply to
  
  Versicherungen so: Ja hm, also das ist so ein Thema, da müssten wir mal gucken, aber bei Cyberversicherungen, eher nein. https://www.golem.de/news/laut-gdv-crowdstrike-schaeden-nicht-vom-versicherungsschutz-abgedeckt-2407-187267.html
  In conversation about 2 months ago permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: www.golem.de
    
    Golem.de: IT-News für Profis
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:43 JST Bianca Kastl
  in reply to
  
  Das führt aber auch zu einer veränderten Architektur von EDR-Funktionen:
  - Mehr Zwiebelschalen um Endpoints herum und nicht auf Endpoints (Firewalls, Malwarescanner auf Netzwerk Storage)
  - Mehr Anwendungen in vollständig kontrollierbaren Umgebungen (der Server, die Cloud, die einfacher nach XYZ definierbar sind)
  - generell eher weniger mächtiger Endpunkte
  Das ist okay, weil auf den Nicht-Endpunkten ist mehr Performance und mehr
  beherrschbare Kette von Security-Maßnahmen. Bessere Experience.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:43 JST Bianca Kastl
  in reply to
  
  Klar, das geht eher nur im Unternehmenskontext, aber was hat Crowdstrike gerade zerballert: Unternehmen. qed.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:43 JST Bianca Kastl
  in reply to
  
  Irgendwie hat der Thread jetzt zumindest einen endenden Faden und naja, irgendwann ist auch Wochenende.
  Grüße an alle IT-Leute, die den Schlamassel ausbaden müssen. Auf Euch.
  (Und ganz und gar nicht auf die Security Snakeoil Companies)
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:43 JST Bianca Kastl
  in reply to
  
  Lese gerade, dass die Triebfeder des CEO von Crowdstrike "das ist alles zu langsam" war.
  Ja okay, aber wenn dir alles zu langsam ist, musst du sehr schnell zu einem last known good Zustand zurückkommen, wenn es scheitert.
  Und nicht in dem Menschen händisch an Devices rumdoktoren. Herrje. Du musst beides haben, sonst geht schnell nicht. https://en.wikipedia.org/wiki/George_Kurtz
  In conversation about 2 months ago permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: upload.wikimedia.org
    
    George Kurtz
    
    George Kurtz is an American businessman. He is the co-founder and CEO of cybersecurity company CrowdStrike together with Dmitri Alperovitch. He was also the founder of Foundstone and chief technology officer of McAfee. He authored Hacking Exposed: Network Security Secrets & Solutions. He is also a racing driver. Kurtz grew up in Parsippany-Troy Hills, New Jersey, and attended Parsippany High School. He claims that he started programming video games on his Commodore when he was in fourth grade. He went on to build bulletin board systems in high school. He graduated from Seton Hall University with a degree in accounting. After college, Kurtz began his career at Price Waterhouse as a CPA. In 1993, Price Waterhouse made Kurtz one of its first employees in its new security group. In 1999, he co-wrote Hacking Exposed, a book about cybersecurity for network administrators, with Stuart McClure and Joel Scambray. The book sold more than 600,000 copies and was translated into more than 30 languages. Later that year he started a cybersecurity company, Foundstone, one of the first dedicated security consulting companies...
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:44 JST Bianca Kastl
  in reply to
  
  Ich will jetzt nicht die zurecht gebrachten Snake Oil Analogien zu Security Glitzer Lösungen bringen, sondern eher nur sagen, dass die Teil eines Grundproblems sind:
  Die Teile verkaufen sich, weil sie das Gefühl vermitteln, dass Security ein Produkt sei, das ich einmal kaufe und gut.
  Hier ein Packen Security als Anwendung wie praktisch. Macht alles für dich, musste dich auch nicht mehr ärgern.
  Und ist auch gut für Compliance.
  So leider das funktionierende Marketing.
  In conversation about 2 months ago permalink
  Attachments
  1. No result found on File_thumbnail lookup.
    
    http://Compliance.So/
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:44 JST Bianca Kastl
  in reply to
  
  *Cocktail Pause, der Tag war hart*
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:44 JST Bianca Kastl
  in reply to
  
  So, zurück zu EDR Systemen.
  Bei denen liegt oftmals eine sehr falsche Vorstellung zugrunde: Dass es überhaupt möglich ist, in einem System auf einem Device alles Evil dieser Cyberwelt abzuhalten. Nö geht nicht.
  Security ist mehr oder weniger ein Prozess von mehreren Ebenen, die scheitern können, bei denen eine allein nie zum vollständigen Zusammenbruch deiner Security oder deines Systems / Business etc. führen sollte.
  Aber bei Crowdstrike ist genau das passiert.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:44 JST Bianca Kastl
  in reply to
  
  Gucken wir mal auf die User-Perspektive: Ein System, das im Hintergrund Ressourcen frisst, vom Arbeiten abhält und immer dazwischen funkt, tut Security einen Bärendienst.
  Du musst eher zumindest als Unternehmen ausgehen, dass du deine User und dessen Device nicht komplett als Risiko eindämmen solltest, weil sie sonst Methoden gegen das System finden, um arbeiten zu können.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:45 JST Bianca Kastl
  in reply to
  
  Nur deutet die Historie des CEO von Crowdstrike eher auf eine Kultur von "Ah egal, mach mal großes Update" hinaus https://www.golem.de/news/crowdstrike-ceo-nicht-zum-ersten-mal-ganze-unternehmensgruppen-lahmgelegt-2407-187257.html
  Schon mal ein großes Update versemmelt und 2024 noch mal getoppt. Toller Track Record (ironisch).
  In conversation about 2 months ago permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: www.golem.de
    
    Golem.de: IT-News für Profis
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:45 JST Bianca Kastl
  in reply to
  
  Also erster möglicher Fehler: Falsche Updatekultur beim Dienstleister.
  Auch wenn du dir mit deinen Updates relativ sicher bist und auch wenn du ne edgy Security Good Cop Programmerbude bist: Der Teufel in der IT ist eine Kette von unvorhergesehenen Problemen für die du vielleicht nichts kannst.
  Daher ist es vollkommen okay, wenn du erst mal bei nicht kritischen Updates guckst, ob von deinen x tausend Instanzen nach Update und Reboot etc. die noch laufen und du dann erst die anderen Updatest.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:45 JST Bianca Kastl
  in reply to
  
  Das kann tatsächlich ne Art von Design Goal bei IT-Prozessen sein: Schutz vor totaler Verkonfigurierung oder Update-Zerstörung durch nur eine Änderung.
  Was hier tatsächlich genau so eingetreten ist.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:45 JST Bianca Kastl
  in reply to
  
  Das zweite Problem betrifft Systeme wie Crowdstrike Falcon.
  Also sagen wir mal EDR-Systeme im Allgemeinen (Endpoint Detection and Response).
  Die haben oft (eigentlich immer) das Ansinnen als eine Art Superheld Systeme zu schützen.
  Und deshalb brauchen die auch immer ganz ganz viele Systemprivilegien.
  Überhebliches und gefährliches Gehabe (siehe Updatekultur).
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:46 JST Bianca Kastl
  in reply to
  
  Spannend finde ich die Timeline hier mal als Ausgangslage:
  04:09 UTC Release fehlerhaftes Update im Release Channel
  05:27 UTC Update wird zurückgezogen
  Das ist es echt nicht viel Zeit. Eigentlich
  In conversation about 2 months ago permalink
  Attachments
  1. At 04:09 (UTC) Windows virtual machines on Azure began rebooting and crashing, and at 06:48 Google Compute Engine also reported the issue. At 07:15, Google identified it was the CrowdStrike update at fault. CrowdStrike CEO George Kurtz confirmed that a faulty driver update released by CrowdStrike caused the outage and confirmed that it was not a cyberattack.[l[3! Technical details An update to a configuration file (here called a channel file) issued at 04:09 UTC on 19 July 2024 conflicted with the Windows sensor client, causing affected machines to enter a blue screen of death with the stop code PAGE_FAULT_IN_NONPAGED_AREA . This left machines stuck in a boot loop or in recovery mode.Hosts running Windows 7 or Windows Server 2008 R2 are not affected. Remedy Affected machines could be restored by booting into safe mode or the Windows Recovery Environment and deleting the %windirs \System32\drivers\CrowdStrike\C-00000291x.sys files.As this process must be done locally on each individual machine, it may take days for affected businesses to restore all systems. CrowdStrike reverted the content update at 05:27 UTC, and devices booted after the reversal are not impacted. At 09:45 UTC, CEO George Kurtz confirmed that the fix was deployed.
    https://files.mastodon.social/media_attachments/files/112/814/544/854/577/770/original/499f68cc484042e0.png
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:46 JST Bianca Kastl
  in reply to
  
  Wir merken an der Timeline des Wikipedia Artikels allein, dass hier sehr schnell Updates eine Kaskade von Problemen auslösen. Nein, das Azure Problem ist eine von Crowd Strike VMs, keines von Azure https://azure.status.microsoft/en-gb/status
  Wir haben aber in der Timeline 1 Stunde 18 Minuten als Zeit, in der ein Update verteilt werden konnte, das den ganzen Schlamassel auslösen konnte.
  Das ist… lasst uns mal gucken, ob das viel oder wenig Zeit ist für ein Update.
  In conversation about 2 months ago permalink
  Attachments
  1. Untitled attachment
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:46 JST Bianca Kastl
  in reply to
  
  Erst mal die relativ einfache Frage: Was rechtfertigt ein weltweites Update an alle meine Endpunkte ohne Verfahren im Rolling Update*:
  1. Es brennt wirklich
  2. Eigentlich nichts
  *Rolling update ist ein Verfahren, bei dem Instanzen schrittweise geupdated werden. Dabei nimmt man schrittweise ein paar Prozent von upzudatenden Instanzen auf eine neue Version und schaut, ob die nicht explodieren. Wenn dann die ersten 5 % meinetwegen okay waren, kommt so langsam der Rest nach bis 100%.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bianca Kastl (bkastl@mastodon.social)'s status on Sunday, 21-Jul-2024 00:06:46 JST Bianca Kastl
  in reply to
  
  Klar, so ein Rolling Update Verfahren kann manchmal ein paar Tage dauern, bis alles aktuell ist.
  Aber: Es gibt Updates, bei denen das vollkommen okay ist. Google macht das z.B. bei großen Android Updates sehr erfolgreich.
  Ausnahme: Es ist so ein Zero Day Fix Update, da ist ein Rolling Update eher zu langsam.
  Was herauszufinden ist: Wie kritisch das Update von Crowdstrike war. War es kein Hotfix Zero Day irgendwas Update, war das Updateverfahren eher fahrlässig.
  
  In conversation about 2 months ago permalink

Public

Conversation

Notices

Feeds