Oct 26 2008

Microsoft’s collaboration stuff any day now?

It’s Monday morning on October 27th as I write here in Australia.

Steve Balmer gave hints in 2 separate reports at the beginning of the month that they’ll be announcing their in-Office collaboration stuff this week.

The first report was in www.cio.co.uk

Ballmer:So we are embracing Software + Services, Cloud Computing as hard as anybody. By the time we finish our Professional Developers Conference this month, I think you’ll have to say that there is nobody out there with as wide a range of Cloud Computing services as Microsoft, including, dare I say it, Google …

CIO: Steve, I guess the $64,000 question from a lot of people’s point of view is, is there going to be an Office for the Web, something that really competes head on with Google Docs, Google Apps?

Ballmer: .. I think what people want is something as rich as Microsoft Office, something that you can ‘click and run’, if you are not at your own desk. Something that is compatible, document-wise with Microsoft Office and something that offers the kind of joint editing capabilities that is nice in Google Docs and Spreadsheets. Will Microsoft Office offer that? Yes! Standby for details in the next month.

CIO: So, in the backend of Microsoft R&D, are there people beavering away at versions of Word, PowerPoint, Excel, etc, that are purely web based? Or, is it always going to be this hybrid?

Ballmer: What does it mean to be purely Web based? Do we want them to be as only as powerful as ‘runs in a browser’? No. We want software that is more powerful than runs in a browser. Does that mean we will not have some neat stuff that does run in the browser? No.

We think you’ll actually want the full power of Word, Excel and PowerPoint – and you’ll want to be able to get that simply. But, if you just happen to be in an Internet cafe kiosk and you want to do some light editing, perhaps we need to have a way to support you in that as well, inside the browser. ..

In another, in response to a question about Office Live, he said:

“Office Live has a few things left it needs to do. Number one, and probably most important, is to make sure that people using Office have greater ability to collaborate with one another. We have some of that today with [Office Live] Workspaces, as well as that we’ve got SharePoint; we can do more and some of those things will be better than the other alternatives.

Number two, is when we do Office Live, it has to be true to Office; you’ll need to be able to have full Office documents and programs and share them.

Number three, we have to make it so that – most people use Office most of the time from a single machine. But if you’re away from your desk, at a cafe, a kiosk or your school library, and you don’t have Office, you’ll want to be able to do something quickly; we have to make sure you can get it easily, stream it down, put it in a browser, something like that there… details coming in a few weeks.

I’m not going to write here what I think they are likely to announce.  More sensible to wait a little longer.  It will be interesting though to see what is available immediately, and how much is just vaporware.

Oct 20 2008

Plutext walkthrough

PlutextWalkthrough (PDF) is a step by step guide to collaborating on a Word document using Plutext.

It contains more or less the same information as my last blog post, but in a format which allows you to avoid the videos.

Oct 01 2008

Collaboration in Word – ready for alpha testing

Plutext enables everyone on your team to make changes in Word, at the same time (ie it lets you collaborate just as you can in Google Docs, but in your familiar Word environment, with formatting, change tracking etc).

Here is a short screencast of the gist of it:

Get the Flash Player to see this player.

If you are working on legal documents, government reports, or other formal deliverables you’ll probably want to make the process more structured.  Here is an excerpt from an old screencast showing our features for lawyers and others requiring accountability:

Get the Flash Player to see this player.

If you want to give it a try, the easiest way to try it out is to download our Word 2007 add-in, then fire up Word and login to the “public0902″ group with these “tester” settings (on Word’s review ribbon, click our “File” button, then Settings), using password “tester”:


Click to enlarge

then open an existing document (from the Plutext “File” button on the Review ribbon).

You can get a colleague to work with you on a document. Or you can simulate collaboration simply by opening the document twice on your PC (which is what I’ve done in the screencasts above).

Right now, you need Word 2007.  Next week, we’ll release an updated build of our cross-platform client which you can try.

This video shows you how to add your own document to the public space (or your private space):

Get the Flash Player to see this player.

But be careful, anyone else can see the documents if you just use the “public” group.

If you’d like a little privacy, you can setup a space of your own on our test server.

We’d love to know what you think, either in the comments, or our forums, or privately (jason@plutext.org).

Please report problems with the Word add-in here, and server problems here.  Thanks.

Naturally, there are a few limitations in this alpha, including:

  • the Audit function doesn’t like bookmarks
  • adding an image won’t work

Finally, if you want to uninstall the Word add-in, you can do this from Window’s add/remove programs in the usual way.

Jul 22 2008

docx4j v2.0 released

We’re pleased to announce that we’ve released v2.0 of docx4j.

docx4j is an open source Java library for manipulating OpenXML WordprocessingML documents, released under the Apache software licence. docx is the default file format in Word 2007 in Microsoft Office 2007.

docx4j supports the following:

  • Open existing docx (from filesystem, SMB/CIFS, WebDAV using VFS)
  • Create new docx (just one line of code)
  • Programmatically manipulate the docx document (of course), including tables, images
  • Import a binary doc (proof of concept)
  • Import/export Word 2007′s xmlPackage (pkg) format
  • Save docx to filesystem as a docx (ie zipped), or to JCR (unzipped)
  • Apply transforms, including common filters
  • Export as HTML or PDF
  • Diff/compare paragraphs or sdt (content controls), outputting OpenXML with changes marked up
  • Font support (font substitution, and use of any fonts embedded in the document)
  • Use the power of JAXB to do other cool stuff

Get it from here.

What is it about this release that warrants being labeled v2.0?

The new features include image support, diff, and xmlPackage.  A factor is the version numbering convention Microsoft has chosen for their Open XML SDK: its v2.0 which will first contain an API for WordprocessingML.

So think of a “level 1″ API as one which handles the Open Packaging conventions (basically, the unzipping step), but leaves you to handle the document (part) content using low level XML (DOM, SAX, etc).

A “level 2″ API is one which gives you a higher level API to manipulate the part content.  At the very least, this would include objects to represent paragraphs, tables, styles etc.  But you’d also expect it to be easy, for example, to add a paragraph using a specified style (maybe this is “level 3″?  In any case, docx4j can do it)

Given that docx4j brought a “level 2″ WordML API to the Java world 6 months ago, it is appropriate that it be labelled version 2.0.

Jul 14 2008

“Document locked” – never again!

Last Thursday I demo’d our Plutext collaboration system to an audience of lawyers and legal technologists and some old friends at the Victorian Society for Computers & the Law’s Legal Technology Conference 2008.

The accompanying presentation is here (pdf).

Our approach to collaboration means you will never be told your document is locked or checked out by someone else.

This in itself is a great step forward for many long-suffering users of traditional document management systems.

I’m collecting screenshots of locked / checked-out messages from different document management systems.  So next time this happens to you, please email it to me.  I’m jason, that’s at plutext.org.  Thanks.

May 03 2008

Click to try docx4all v0.2

Jo and I are pleased to have just uploaded a new version of docx4all for you to try.

We’ve added quite a few features since I last blogged about docx4all (21 Feb).

New features include:

The VFS file chooser allows docx4all to open documents not just from the local file system, but also from a WebDAV server (such as Alfresco), and potentially, CIFS etc.  To do this, docx4all uses VFSJFileChooser, and webdavclient4j (a project we’ve started to address the gap left when Apache retired Slide, including its WebDAV client).

The incoming document filter is used to convert certain features of WordprocessingML which docx4all can’t yet handle, into something it can.   Examples include proofErr, hyperlink, and lastRenderedPageBreak.  This behaviour relies on a feature of docx4j, which makes it easy to apply a transform to a docx package (by converting it to pkg:package format).

Docx4all can’t yet render tables (let alone edit them), but we’re working on changing that.

Apr 30 2008

modified Office Open XML schema now in Subversion

We’ve been tweaking the schemas – especially wml.xsd – to make the Java classes generated by JAXB’s xjc more user-friendly.

I’m satisfied that this is permitted by ECMA, so I’ve put the modified schemas into subversion .

For anyone interested in the reasoning, the Ecma website says:

“Ecma Standards and Technical Reports are made available to all interested persons or organizations, free of charge and copyright, in printed form and, as files in Acrobat (R) PDF format.”

For this to apply, it needs to be an “Ecma Standards or Technical Report”.

That page says “A Standard or a Technical Report is a formal document prepared by an Ecma Technical Committee and approved by the Ecma General Assembly.”

Office Open XML was so approved.

So the only possible glitch would be words to the effect that the schema aren’t part of the official standard.

I’ve checked the language in parts 2 and 4 (of the Ecma TC45 Final Draft) which says “This Office Open XML specification includes a family of schemas … The normative definition of these schemas reside in an accompanying file named … which is distributed in electronic form only.”

Which makes it clear the schemas are part of the Standard :)

So the ECMA standard’s XSD are “free of copyright” – an explicit waiver of copyright. So no problemo in creating derivative works.

Apr 10 2008

docx4j now released under Apache License

We’re pleased to announce that docx4j is now available under the Apache License (v2).

This is a response to feedback on an earlier post.  This is also the last license change we’ll be making to docx4j. Word documents are mostly manipulated in corporate environments.  This change removes barriers to adoption of docx4j by business and institutions.

docx4j uses org.merlin.io to efficiently turn streams inside out. That package had been available under the GPL.  Its author, Merlin Hughes, today kindly released it under v2 of the Apache License, so we now use it under that license.

There’s a new nightly build of docx4j available from the downloads page if you want to grab it.  This build can load/save to/from a WebDAV server – more on that in another post.

Mar 18 2008

Sun’s bug votes on steroids

I like programming in Java.  It is still a great way to write cross-platform code.  I’ve bet my business on it.

But sometimes, Sun is just too slow to fix bugs (or make the fixes available). And this is still their role, even when a user has a fix to contribute.

Take the following 2 which have bitten me this week:

  1. Preferences broken if you use org.apache.xalan.processor.TransformerFactoryImpl
  2. Printing on Ubuntu 7.10

Fixes haven’t become available for either of these yet on Java 6 (though the first has been closed  here and here)

Sun really needs to invest more in Java, to get all the outstanding bugs fixed, and the fixes out quickly.  (Yes, people who write and use open source expect fixes more quickly than most vendors can deliver them.  Life is much quicker in the open source world)

But as we know, Sun doesn’t make much money from it – directly at least.  And even though Sun is quite clear in their strategy to use Java to drive sales of their hardware, this lack of revenue shows – shows up as a lack of support.

So what about a logo people using Java can put on their websites, which communicates “I bought some Sun hardware to support Sun’s investment in Java” to other people who use Java.  This may make them consider buying some Sun gear as well, and proliferation of the logo would remind Sun that Java really is what butters their bread.

Maybe that needs to be  “to support Sun’s investment in Java on Linux” (or even on Linux x86_64) – since its not Windows that these bugs occur on.

Or how about a way for Sun to earn credits towards a Sun hardware purchase: “If Sun fixes this bug, it will earn them a notional half a purchase”.  Fix this one as well, and I’ll buy something.  A great little site for someone to write.

Yes, I know you can vote for a bug (the printing bug has 45 votes since 26 November 2007 – that’s a lot of votes, comparatively speaking – but still there is no indication of when a fix will be available).

But Sun is wildly optimistic in only giving people three votes, no matter how many bugs are causing them grief.

I’ve bought 3 servers, 2 workstations, and a laptop in the last 6 months or so, and none of these are from Sun.  But I would change my purchasing policies for some tangible indication that result in quicker bug fixes.  So my third idea in this little brainstorm – what about allocating special higher priority bug votes when  people buy Sun gear?

Mar 12 2008

Office Online – not yet after all

Well, there were a few interesting announcements from Microsoft last week, but they didn’t include OaaS (Office as a Service), nor improved collaboration.

The three announcements:

  1. Office Live Workspace Beta is publicly available
  2. Sharepoint Online has been available to businesses with over 5000 employees; now it is available in beta to businesses with under 5000 (provided you are based in the US)
  3. Silverlight 2 Beta

Office Live Workspace doesn’t have real collaboration, yet. As ReadWriteWeb puts it:

Although Office Live Workspace allows for collaboration, it’s not real-time, online collaboration. Instead, if one user is editing a file, another will be informed the file is “checked out.” When they finish editing and save their changes the document is checked back in for other users to access.

The situation is similar in Sharepoint. As Bill Gates put it:

I have a Word document that if I open it up, you can see that I’ve been force [sic] versioning, check in/check out on my documents, so I could check out the document, make a change, and then come down and save those changes

Mr Gates explained that between these 2 products Microsoft intends to cover the whole market:

We want to scale [Sharepoint] all the way down, so that literally you don’t have to have an IT capability, and that’s where we get into what we’ve branded Live. So we’re working that one up through small customers. We want to work [Sharepoint] down and make sure there’s no gap in-between.

When Microsoft eventually gets around to offering real collaboration, there is no reason for either of those 2 products to do it differently (unless they wish to upsell people to Sharepoint).  So its more a question of which one gets real collaboration first; Sharepoint customers are probably more deserving, but Office Live Workspace customers might make good guinea pigs.

“Microsoft Office Live Workspace is being offered free of charge. .. The company expects to release the final public version of Office Live Workspace later in the year.”

That’s not to say that real collaboration will necessarily be free, though it might be.

For hosted Sharepoint (Microsoft Online Services), the licensing model:

New customers and customers without Microsoft Software Assurance can purchase Microsoft Online Services as a per-user subscription. Existing customers with Software Assurance on their Microsoft Client Access Licenses can purchase a user subscription at a discount, enabling them to maximize their existing Microsoft software investments. Customers with a subscription have rights to both Microsoft Online Services and to access on-premises server software, giving them the ability to blend Web-based services with on-premises software.

So when will the collaboration offering happen?

Venture Beat says that “with Microsoft still raking in so much money from traditional software, [full-on war with Google Apps is] still at least a couple years away”.  Mary Jo tells us Microsoft will fill in the blanks around its Live services strategy at its Professional Developers Conference in October.

Which brings me to the Silverlight 2 beta.  I’m inclined to think Microsoft will offer real collaboration as soon as they’ve got a suitable client (ie not before Silverlight 2 has been through its beta cycle).  The TextGlow docx viewer sets high expectations as to how this might perform.