Paul Krill
Editor at Large

Searchcode brings regex to code search

news
Dec 1, 20152 mins

The engine's developer wants to expand its language capabilities and double its index of repositories

Searchcode, a source code and documentation search engine, is looking to add regular expression capabilities and double the number of repositories indexed.

With regular expression support, Searchcode could get a power boost in line with Google’s now-defunct Code Search application, project developer Benjamin Boyter, said in an email. “Regular expression support would allow developers [the] most power possible to search over code,” he said, noting that highly complex queries not possible in existing code search engines would be enabled, he said.

But developing a regular expression capability could be “cost-prohibitive,” Boyter said. Support, however, could come from sales of a possible server version of Searchcode. “A downloadable version has the advantage that private repositories can be indexed, including ones that exist in private hosted Bitbucket servers. It would allow companies to index their code without exposing it on the public Internet in any way.”

Searchcode, which has been around for about five years, indexes API documentation, code snippets, and open source software repositories. It downloads source code from repositories like GitHub, BitBucket, CodePlex, and SourceForge, then processes it to produce metadata, such as the number of lines, language, and code comments. “It started out as a side project,” Boyter said. “I had always wanted to build my own search engine, but the capital expenses are cost-prohibitive. So I started building documentation search over programming APIs.” Searchcode has its own free API.

SearchCode works on languages ranging from C to C#, Dart, Go, Java, JavaScript, Objective-C, PHP, and Ruby. Boyter wants to expand its language capabilities, double its index of around 7.5 million repositories, and index the publicly available Git repositories hosted by GitLab.

Currently, there are ads on the site to cover basic hosting costs, said Boyter. “Developers tend to be in the group that is not very receptive to advertising,” said Boyter. “As such, I will not be looking to make the site a profit machine.”

Paul Krill

Paul Krill is editor at large at InfoWorld. Paul has been covering computer technology as a news and feature reporter for more than 35 years, including 30 years at InfoWorld. He has specialized in coverage of software development tools and technologies since the 1990s, and he continues to lead InfoWorld’s news coverage of software development platforms including Java and .NET and programming languages including JavaScript, TypeScript, PHP, Python, Ruby, Rust, and Go. Long trusted as a reporter who prioritizes accuracy, integrity, and the best interests of readers, Paul is sought out by technology companies and industry organizations who want to reach InfoWorld’s audience of software developers and other information technology professionals. Paul has won a “Best Technology News Coverage” award from IDG.

More from this author