Quick post … I’m excited to note that I’ll be joining Cray, the preeminent HPC company, to help develop solutions for HPC customers to consume supercomputing resources in the cloud. I start the week of the 22-April. More soon, but I gotta say, I’m quite excited about this!
So I saw this , and yes, it is quite funny. There’s a discussion of this at HackerNews, which seems to follow a number of conventional pathways. Most of them missing the obvious implied humor.
A short note – today was my last day with Joyent. They are a wonderful company, building great things. Excellent technology, and technologists. I wish them nothing but success. For the immediate future, I’ll be working on consulting projects, as well as looking for the next great opportunity within high performance computing, storage, cloud. I’m … Read more Onward and upward in #HPC
Yeah … this was a fun one. Because I only recently started using a holistic VM management/control plane for my home machines, I didn’t notice that I had 2 VMs of the blog running. I was doing some surgery to fix something, then tailed the logs … and didn’t see the traffic. Took me a … Read more Note to self: have only one blog VM running
So this happened on the 3rd, on one of my systems Feb 3 03:02:39 calculon kernel: [195271.041118] INFO: task kworker/20:2:757 blocked for more than 120 seconds. Feb 3 03:02:39 calculon kernel: [195271.048116] Not tainted 4.20.6.nlytiq #1 Feb 3 03:02:39 calculon kernel: [195271.052678] “echo 0 > /proc/sys/kernel/hung_task_timeout_secs” disables this message. Feb 3 03:02:39 calculon kernel: [195271.060626] … Read more Data loss, thanks to buggy driver or hardware
The systemd article is on LWN, and discusses the “tragedy” of it. The ZFS post was linked from HackerNews and discusses risk to ZFS’s future from the perspective of FreeBSD leveraging ZFS on Linux as its upstream. Ok, first onto systemd. For those who don’t know systemd, think of it as the borg that ate … Read more Interesting articles on systemd and ZFS
Looking back on past reviews from 2013 and a few other posts, and what has changed since then up to 2019 (its early, I know), I am struck by a particular thought I’ve expressed for decades now. In 2009 I wrote HPC has been moving relentlessly downmarket. Each wave of its motion has a destructive … Read more Reflections on where we’ve been in HPC, and thoughts on where we are going
I’ve been saying this for mumble decades. What I mean by “designed to fail” isn’t specifically that someone wants a system to fail. Rather, by various interactions, wishful thinking, drinking of one’s own kool-aid, a system is placed on an inexorable path to failure. Without something to divert it in time, failure is the most … Read more Systems that are designed to fail, often do
This is a fun one, I’ve been playing with for the last 5 days or so. I’m helping someone out with backups, and they changed their mind on what they wanted backed up. So I started deleting the backups they didn’t want. One of the machines contained a set of directories for hashdeep which includes … Read more A bug in s3 buckets with no apparent way to request support to deal with it
Tiburon specifically solves the problem of stateful vs stateless boots, roll forward/backwards in images, consistent booting with immutable images. Coupled with an image generator and a programmatic config environment (as in Nyble and other tools), you have the workings of the non storage/networking parts of a cloud or cluster manager. The philosophy behind this has … Read more Finally posted Tiburon on github