Guess where I am. https://netpreserve.org/ga2024/
Conversation
Notices
-
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:05:37 JST Stéphane Bortzmeyer -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:05:34 JST Stéphane Bortzmeyer 🇳🇱 🇨🇭
Haelwenn /элвэн/ :triskell: likes this. -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:05:36 JST Stéphane Bortzmeyer Poster sessi:on at #WAC. I learn that there are two countries in Europe *without* a legal deposit of the Web. I let you guess their names.
-
Embed this notice
Erwan 🚄 (r1rail@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:06:18 JST Erwan 🚄 @bortzmeyer BFM (pas la télé)
Haelwenn /элвэн/ :triskell: likes this. -
Embed this notice
Haelwenn /элвэн/ :triskell: (lanodan@queer.hacktivis.me)'s status on Friday, 26-Apr-2024 17:07:03 JST Haelwenn /элвэн/ :triskell: @bortzmeyer Closed Garden :D -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:12:33 JST Stéphane Bortzmeyer Facebook closed its API long ago, pretending it was because of Cambridge Analytica (but the real reason was commecial), shutting down many research projects on social media.
Haelwenn /элвэн/ :triskell: likes this. -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:12:34 JST Stéphane Bortzmeyer Now, panel at #WAC "Archiving Social Media In An Age of APIcalypse" (Twitter closed its API).
[Wondering how to archive the fediverse.]
-
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:12:35 JST Stéphane Bortzmeyer Poster session at #WAC: I loved the "(most|least) cool club", a club of people managing Web crawlers and sharing (unfortunately with JIra and Confluence) regexps of things to exclude from the crawl (loop traps and things like that).
-
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:53:53 JST Stéphane Bortzmeyer TikTok has an API but its use requires that all research papers where it was used have to be pre-approved by TikTok!
Haelwenn /элвэн/ :triskell: likes this. -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:53:54 JST Stéphane Bortzmeyer Jérôme Thièvre (INA) is the first to mention Bluesky (which apparently has a working API but little content).
Haelwenn /элвэн/ :triskell: likes this. -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 17:53:56 JST Stéphane Bortzmeyer Several speakers in the panel do not follow the title: they talk about what they did *before*, when API use was possible.
Anat Ben-David, on the contrary, explains what could be done in the future. APIs were not so good, after all (for instance, you are never sure of what they hide). -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 19:04:27 JST Stéphane Bortzmeyer Argh, Mastodon butchers the PWID :-(
Haelwenn /элвэн/ :triskell: likes this. -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 19:04:28 JST Stéphane Bortzmeyer Now, back to identifiers, at #WAC "The Potentials and Challenges for Researchers and Web Archives Using the Persistent Web IDentifier (PWID)" (The speakers even have T-shirts branded "PWID".)
(The english-speaking Wikipedia page on PWID is not the expected one.)
An example of PWID:
urn:pwid:archive.org:2016-01-22T10:08:23Z:page:https://www.dr.dk(from the Internet-Draft)
-
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 19:04:31 JST Stéphane Bortzmeyer In the end, back to traditional Web scraping, and data donations.
(Medialab SciencesPo currently crawls and scraps #Doctissimo.) -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 21:23:00 JST Stéphane Bortzmeyer If I understand correctly, the goal is to put Internet Archive on IPFS (as WARC files).
But IPNS (IPFS naming system) has limits. Hence the new type, the IPARO, a list of IPFS names pointing to the various versions of an archived Web page.
Plus links going directlty to some points in the list (for resiliency, since IPFS does not guarantee persistence.)
Haelwenn /элвэн/ :triskell: likes this. -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 21:23:01 JST Stéphane Bortzmeyer "Decentralized Web Archiving and Replay via InterPlanetary Archival Record Object (IPARO)" is an attractive title.
-
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 21:23:02 JST Stéphane Bortzmeyer Now, "bit preservation" (presevving bits for the long term, not taking semantics into account).
* several copies (and no SPOF)
* check them -
Embed this notice
Stéphane Bortzmeyer (bortzmeyer@mastodon.gougere.fr)'s status on Friday, 26-Apr-2024 21:23:03 JST Stéphane Bortzmeyer The Internet-Draft is expired and there is no plan to revive it. The only specification for #PWID seems to be https://www.iana.org/assignments/urn-formal/pwid
-
Embed this notice