On the libraryâs side, some people thought Ginsparg was too hands-on. Others said he wasnât patient enough. A âgood lower-level manager,â according to someone long involved with arXiv, âbut his sense of management didnât scale.â For most of the 2000s, arXiv couldnât hold on to more than a few developers.
There are two paths for pioneers of computing. One is a life of board seats, keynote speeches, and lucrative consulting gigs. The other is the path of the practitioner who remains hands-on, still writing and reviewing code. Itâs clear where Ginsparg standsâand how anathema the other path is to him. As he put it to me, âLarry Summers spending one day a week consulting for some hedge fundâitâs just unseemly.â
But overstaying oneâs welcome also risks unseemliness. By the mid-2000s, as the web matured, arXivâin the words of its current program director, Stephanie Orphanâgot âbigger than all of us.â A creationist physicist sued it for rejecting papers on creationist cosmology. Various other mini-scandals arose, including a plagiarism one, and some users complained that the moderatorsâvolunteers who are experts in their respective fieldsâheld too much power. In 2009, Philip Gibbs, an independent physicist, even created viXra (arXiv spelled backward), a more or less unregulated Wild West where papers on quantum-physico-homeopathy can find their readership, for anyone eager to learn why pi is a lie.
Then there was the problem of managing arXivâs massive code base. Although Ginsparg was a capable programmer, he wasnât a software professional adhering to industry norms like maintainability and testing. Much like constructing a building without proper structural supports or routine safety checks, his methods allowed for quick initial progress but later caused delays and complications. Unrepentant, Ginsparg often went behind the libraryâs back to check the code for errors. The staff saw this as an affront, accusing him of micromanaging and sowing distrust.
In 2011, arXivâs 20th anniversary, Ginsparg thought he was ready to move on, writing what was intended as a farewell note, an article titled âArXiv at 20,â in Nature: âFor me, the repository was supposed to be a three-hour tour, not a life sentence. ArXiv was originally conceived to be fully automated, so as not to scuttle my research career. But daily administrative activities associated with running it can consume hours of every weekday, year-round without holiday.â
Ginsparg would stay on the advisory board, but daily operations would be handed over to the staff at the Cornell University Library.
It never happened, and as time went on, some accused Ginsparg of âbackseat driving.â One person said he was holding certain code âhostageâ by refusing to share it with other employees or on GitHub. Ginsparg was frustrated because he couldnât understand why implementing features that used to take him a day now took weeks. I challenged him on this, asking if there was any documentation for developers to onboard the new code base. Ginsparg responded, âI learned Fortran in the 1960s, and real programmers didnât document,â which nearly sent me, a coder, into cardiac arrest.
Technical problems were compounded by administrative ones. In 2019, Cornell transferred arXiv to the schoolâs Computing and Information Science division, only to have it change hands again after a few months. Then a new director with a background in, of all things, for-profit academic publishing took over; she lasted a year and a half. âThere was disruption,â said an arXiv employee. âIt was not a good period.â
But finally, relief: In 2022, the Simons Foundation committed funding that allowed arXiv to go on a hiring spree. Ramin Zabih, a Cornell professor who had been a long-time champion, joined as the faculty director. Under the new governance structure, arXivâs migration to the cloud and a refactoring of the code base to Python finally took off.
One Saturday morning, I met Ginsparg at his home. He was carefully inspecting his sonâs bike, which I was borrowing for a three-hour ride we had planned to Mount Pleasant. As Ginsparg shared the route with me, he teasinglyâbut persistentlyâexpressed doubts about my ability to keep up. I was tempted to mention that, in high school, Iâd cycled solo across Japan, but I refrained and silently savored the moment when, on the final uphill later that day, he said, âI mightâve oversold this to you.â
Over the months I spoke with Ginsparg, my main challenge was interrupting him, as a simple question would often launch him into an extended monolog. It was only near the end of the bike ride that I managed to tell him how I found him tenacious and stubborn, and that if someone more meek had been in charge, arXiv might not have survived. I was startled by his response.
âYou know, one personâs tenacity is another personâs terrorism,â he said.
âWhat do you mean?â I asked.
âIâve heard that the staff occasionally felt terrorized,â he said.
âBy you?â I replied, though a more truthful response wouldâve been âNo shit.â Ginsparg apparently didnât hear the question and started talking about something else.
Beyond the dramaâif not terrorismâof its day-to-day operations, arXiv still faces many challenges. The linguist Emily Bender has accused it of being a âcancerâ for the way it promotes âjunk scienceâ and âfast scholarship.â Sometimes it does seem too fast: In 2023, a much-hyped paper claiming to have cracked room-temperature superconductivity turned out to be thoroughly wrong. (But equally fast was exactly that debunkingâproof of arXiv working as intended.) Then there are opposite cases, where arXiv âcensorsââso say criticsâperfectly good findings, such as when physicist Jorge Hirsch, of h-index fame, had his paper withdrawn for âinflammatory contentâ and âunprofessional language.â
How does Ginsparg feel about all this? Well, heâs not the type to wax poetic about having a mission, promoting an ideology, or being a pioneer of âopen science.â He cares about those things, I think, but heâs reluctant to frame his work in grandiose ways.
At one point, I asked if he ever really wants to be liberated from arXiv. âYou know, I have to be completely honestâthere are various aspects of this that remain incredibly entertaining,â Ginsparg said. âI have the perfect platform for testing ideas and playing with them.â Though he no longer tinkers with the production code that runs arXiv, he is still hard at work on his holy grail for filtering out bogus submissions. Itâs a project that keeps him involved, keeps him active. Perhaps, with newer language models, heâll figure it out. âItâs like that Al Pacino quote: They keep bringing me back,â he said. A familiar smile spread across Ginspargâs face. âBut Al Pacino also developed a real taste for killing people.â
Let us know what you think about this article. Submit a letter to the editor at mail@wired.com.
+ There are no comments
Add yours