Posts tagged tech
Last week The Document Foundation blog announced the release of the LibreOffice 7.0 Getting Started Guide in Brazilian Portuguese. This new guide is based on the English language guide released last month (posts passim).
In fact the Brazilian Portuguese guide is based on the English version. Its basis was a machine translation of the English guide which was then revised by members of the LibreOffice Brazilian community. Future editions of the Getting Started Guide will be done without translation, but by writing directly in Portuguese about new features in LibreOffice and information about the suite.
Like its English counterpart, the Brazilian Portuguese Getting Started Guide outlines the development of LibreOffice and introduces each of its modules: spreadsheets (Calc), presentations (Impress), vector drawings (Draw), text processing (Writer), equations (Maths) and databases (Base). In addition to these modules, there are several chapters describing important concepts common to all modules such as styles, printing, electronic signing, macros, exporting in various formats, redacting and document classification.
Contributors to the new guide were Vera Cavalcante, Jackson Cavalcanti Jr., Timothy Brennan Jr., Flávio Schefer, Felipe Viggiano, Raul Pacheco da Silva, Túlio Macedo and Olivier Hallot.
The new Brazilian Portuguese LibreOffice 7.0 Getting Started Guide can be downloaded in PDF format.
In addition to the new guide, the Brazilian LibreOffice Community also produces its own LibreOffice magazine.
Dortmund’s city council has paved the way for “Public Money? Public Code!” In the future, software developed or commissioned by the administration will be made available to the general public, the Free Software Federation Europe (FSFE) reports.
Back in February, the city council approved a motion previously submitted by the SPD, Bündnis90/Die Grünen, CDU, Die Linke+ and FDP/Bürgerliste. In the future, Free Software is to be used wherever possible and software developed or commissioned for development by the administration is to be made available to the general public.
Matthias Kirschner, President of the Free Software Foundation Europe states: “We are happy that the DO-FOSS initiative was able to convince the city of Dortmund of the principle of “Public Money? Public Code”. Free Software gives everyone the right to use, study, share and improve software for any purpose. These freedoms also benefit administrations. Public administrations that follow this principle can benefit from numerous advantages: Collaboration with other government agencies, independence from individual vendors, potential tax savings, innovation and a more solid basis for IT security. The Council’s decision means that there is now the political backing to gradually break down dependencies on proprietary vendors. We will accompany the implementation and at the same time call on other administrations in Germany and Europe to follow Dortmund’s example.“
SUSE was the first Linux distribution I actually used as a day-to-day working system over 15 years ago. It was the distribution on which I learnt about Linux, so it has a special place in my affections.
The impetus to install it came from a friend who bought a set of 5 installation CDs off eBay for me as a present.
Later on, I treated myself to SUSE Linux Professional 9.3 for some £50. It came as a box set of 2 DVDs and 5 CDs, along with a doorstep-sized manual.
SUSE is a good, solid distribution and excellent for business use with its SUSE Enterprise Linux server and desktop offerings and paid-for support.
SUSE also sponsors the community-supported openSUSE project, which develops the openSUSE Linux distribution, which is available in both rolling release (Tumbleweed) and regular release (Leap) versions.
Founded in Germany 1992, SUSE was the first company to market Linux to business. Over the years its ownership has changed many times. In 2004 it was acquired by Novell. Novell and with it SUSE were then purchased by Attachmate (with financial assistance from Microsoft) in 2010. In 2014 Microfocus acquired Attachmate and SUSE was spun off as a separate division under the name SUSE Software Solutions Germany GmbH. Finally, EQT purchased SUSE from Micro Focus for $2.5 billion in March 2019.
News has now emerged that SUSE is being prepared for stock flotation in Europe in via an IPO in the next few months (May is mentioned as the earliest date) with Bank of America and Morgan Stanley executing the IPO with the aid of Goldman Sachs, Deutsche Bank, and Jefferies.
According to Le Monde Informatique, SUSE is likely to have a market valuation of €7-8 bn. for the IPO.
This release over contains 90 bug fixes and improvements to document compatibility.
As usual, TDF suggests that this release is aimed at technology enthusiasts and power users, rather than more conservative business users for whom an older release is recommended.
This latest release is available for all major desktop operating systems (Linux, MacOS and Windows), mobile platforms (Android and iOS) and the cloud. Instead of downloading via the link above, Linux users might like to wait until the update is provided directly via the repositories of their individual distributions, whilst those for mobile devices can be obtained via the app stores for their respective operating systems.
LibreOffice users are invited to join the community so they can both get and provide individual support. Those willing to contribute their time and professional skills to the project can visit the dedicated What Can I Do For LibreOffice website.
Finally, LibreOffice users, free software advocates and community members can give financial assistance to The Document Foundation with a donation via PayPal, credit card or other means.
Amazon was forced to apologise and blamed a “technical error” for a customer being unable to post a review in Welsh of a novel written in Welsh, Wales Online reports.
Cathryn Sherrington of Cardiff had submitted a Welsh Language review which she then translated to English of the book Lladd Duw, by Dewi Prysor.
The book is described by its publisher as a “hefty, ambitious novel set in London and an imaginery [sic] seaside town. It deals with the destruction of civilisation from the standpoint of the working class. An intense, dark novel but with the usual humour from Dewi Prysor.“
Cathryn’s review reads as follows:
Gwych Brilliant. I haven’t read a Welsh book for years – sometimes the formality of written Welsh puts me off – this is brilliant though.
Hawdd i ddarllen, stori gyffroes, cymeriadau diddorol. Wedi joio fo gymaint dwi’n mynd i ddarllen mwy o lyfrau Cymraeg.”
In English the review’s second sentence reads: “Easy to read, exciting story, interesting characters. Have enjoyed it so much I’m going to read more Welsh language books“.
However, Amazon which employs 1,000 people in Swansea, emailed Cathryn implying her review might have broken its guidelines.
There then followed a social media and email exchange between Cathryn and Amazon at the end of which the latter relented, stating: “This was due to a technical error for which we apologise. It has now been resolved.”
In a historic judgment in Italy, Lenovo was ordered to pay €20,000 euros in damages for abusive behaviour for refusing to refund the price of a pre-installed Windows licence in a case initiated by Free Software Foundation Europe (FSFE) supporter Luca Bonissi, the FSFE reports.
A grateful Luca is donating €15,000 of the award to the FSFE.
It should go without saying that everyone should be able to freely choose the operating system to run on their personal computers.However, this freedom is regularly abused by hardware suppliers to such an extent that it is almost impossible to buy a new or used system without having to pay the so-called Windows tax for an unwanted OS. Some computer manufacturers still make it very hard for consumers, forcing them to assert their rights in expensive and exhausting lawsuits.
This is what happened to Luca Bonissi.
In March 2018, Luca bought a brand-new Lenovo Ideapad and decided he didn’t want to run Windows on it. He therefore contacted Lenovo to request a refund for the pre-installed Windows system.
This initiated a lengthy two-year bureaucratic and legal all because the company twice refused to refund the €42 Luca had been charged for the unwanted Windows system. After having his requests denied twice by Lenovo, Luca tried to seek help from the Italian Competition and Market Authority (AGCM). However, when he realised that these efforts were fruitless, Luca decided to take legal action against Lenovo.
He therefore initiated proceedings in a small claims court without legal assistance, but soon sought professional aid when Lenovo proved obstinate.
In June 2019, the Justice of the Peace of Monza upheld Luca’s right to reimbursement and ordered Lenovo to refund €42 for the Windows licence and also ordered the company to pay €130 in legal costs
However, Lenovo was dissatisfied with the verdict and appealed, citing 15 grounds for appeal, implicating Luca in further legal proceedings and yet more expense for legal advice.
Finally, in December 2020, the Court of First Instance in Monza rejected all Lenovo’s arguments, upholding the consumer’s right to a refund for the unused pre-installed operating system. The court noted that the manufacturer itself had expressly assumed this obligation in the Windows licence. Furthermore, in a historic decision, the court imposed punitive damages of €20,000 on Lenovo for abusing the appeal process.
Commenting on his victory in court Luca stated: “The Monza decision demonstrated that is possible to reverse the unacceptable behaviour of big techs. What was taken away from the Free Software community has now been returned to it. I encourage everyone to fight back for their legitimate rights!”
The Document Foundation’s blog announced last week that the LibreOffice Documentation Team had released its LibreOffice 7.0 Getting Started Guide. The Guide, which was previously issued for LibreOffice version 6.4, has been updated to include all the new and improved features of LibreOffice 7.0, the latest version of LibreOffice, the free and open source alternative to proprietary office suites.
The guide has been drafted especially for those wanting to get up to speed quickly with LibreOffice, whether they are new users of office productivity software or already have some familiarity with other office suites, such as Microsoft’s ubiquitous and expensive offering.
The guide provides an introduction the LibreOffice’s 6 major components, i.e.:
- Writer (word processing)
- Calc (spreadsheets)
- Impress (presentations)
- Draw (vector graphics)
- Base (database)
- Math (equation editor)
Furthermore, it also covers some of the features common to all those components – set-up and customisation, styles and templates, macro recording, digital signing and printing.
The guide can be downloaded (PDF format) from LibreOffice’s English Documentation site., which also includes links to documentation in other languages, as well as user guides for earlier LibreOffice releases.
Occasionally in recent weeks, this blog has provided information on keyboard shortcuts for unusual characters (unusual for English that is. Ed.) on a Linux keyboard.
The last of these took the umlaut (diaresis) as its subject (posts passim).
Today, attention turns once again to German and the s key, which can produce two characters, depending upon the combination of keystrokes.
Depressing the AltGr key and s produces “ß“, the German sharp s or esszett, usually transcribed in English as ss.
The other character that can be produced is “§“, which can be produced with the AltGr, Shift and s keys.
Known as the Section sign, it is believed to originate from the Latin signum sectionis, meaning section sign and usually turns up in with reference to legal documents.
Where more than one section of a legal text is involved, the sign is repeated, i.e §§.
The way a completed translation has been produced has changed markedly over the decades since my first days as a translator for Imperial Tobacco in Bedminster, Bristol.
In those days I’d write out the translation in longhand from printed source material and take my manuscript to the typing pool where it would be transformed into typescript.
The next big change came with my learning how to touch-type. By this time I was a freelance with no more access to a typing pool.
In my early freelance days, it was rare to get editable copy that one could overkey with one’s usual word processor, spreadsheet or presentation package. The standard way of working was still from hard copy propped up in a copyholder alongside one’s keyboard.
Then there came a large surge in the use of formats such as PDF – Portable Document Format. This format enables documents, including text formatting and images, to be presented in a manner independent of application software, hardware and operating systems.
If the PDF was text-based, one could simply export the text as plain ASCII text or copy and paste it into a word processor.
However, if I had an image-based PDF to work with, my usual answer was to print it out as hard copy to be propped up in a copyholder alongside my keyboard. This was very expensive in terms of paper and other consumables for the printer, even with a machine as parsimonious as my trusty mono laser printer, whose cartridge was good for printing 3,000 or so pages of copy.
In addition to the expense of printing, there was a far greater drawback to bear in mind, i.e. one could easily miss a sentence or paragraph from the original text when keying in the translated from a hard copy original, with the consequent implications for the quality of the finished work and the client’s satisfaction with it.
Then I discovered OCR – Optical Character Recognition – the mechanical or electronic conversion of images of typed, handwritten or printed text into machine-encoded text.
Here’s a short video explaining the basics of OCR.
gImageReader provides a simple graphical front-end to the tesseract OCR engine. The features of gImageReader include:
- Importing PDF documents and images from disk, scanning devices, clipboard and screenshots;
- Process multiple images and documents in one go;
- Manual or automatic recognition area definition;
- Recognising to plain text or to hOCR documents;
- Recognized text displayed directly next to the image;
- Post-processing of the recognised text, including spellchecking;
- Generating PDF documents from hOCR documents.
I generally just stick scanning the input file to plain text, which can then be fed into a regular office suite for translation. If your office suite can handle HTML that’s the format gImageReader outputs as its hOCR output.
The tesseract OCR engine mentioned above can also be enhanced with language packs for post-recognition spellchecking, as mentioned in the features above. At present, tesseract can recognise over 100 different languages.
In addition to GUI-based OCR, there are also Linux packages available which can perform OCR via the command line interface.
My tool of choice here is OCRmyPDF.
OCRmyPDF is a package written in Python 3 that adds OCR layers to PDFs and, like gImageReader, also uses the tesseract OCR engine.
Using OCRmyPDF on the command line is simplicity itself (as shown in the screenshot above:
ocrmypdf -l [language option] inputfile.pdf outputfile.pdf
More complicated command options are possible, but after using that simple string above, you’ll be able to extract the text from your formerly image-based PDF ready for translation.
By way of conclusion depending on the software itself, OCR packages can also extract text from images such as .jpg files.
Google’s developers evidently have a sense of humour, as the search below shows.
Not all humour from techies is quite so obvious to ordinary mortals and is normally deeply buried in comments in code, mark-up and the like.
Tip of the hat: Kevin Mills