GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Embed Notice

HTML Code

Corresponding Notice

  1. Embed this notice
    Paul Cantrell (inthehands@hachyderm.io)'s status on Monday, 09-Oct-2023 03:07:44 JSTPaul CantrellPaul Cantrell
    in reply to

    Implemented in Swift, w/Swift concurrency + Metal.

    Video has 1 billion sample points per video frame (i.e. randomly apply transforms 1 billion times, count how many times point lands in each pixel).

    Started with single-threaded CPU version: ~50 megapoints/sec on my 2019 Intel MBP.

    Parallel versions (all CPU, CPU/GPU mix) achieved only ~1.5-2x speedup despite 8 cores. Why? Memory-bound! Cost of counting grid hits > cost of actual calculations.

    All-GPU version achieved ~600 Mpoints/sec. Wow!

    In conversationMonday, 09-Oct-2023 03:07:44 JST from hachyderm.iopermalink
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.